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RA antigenic peptides 



The present invention provides novel naturally-processed RA antigenic peptides 
which are candidate markers for erosive and non-erosive RA. These antigenic peptides are 
presented by human MHC class II HLA-DR molecules. Moreover, these antigenic peptides 
linked to MHC class II molecules, as well as antibodies reactive with said antigenic 
peptides, nucleic acids encoding said antigenic peptides, and nucleic acid constructs and 
host cells for expressing said antigenic peptides are provided. The antigenic peptides of the 
invention as well as the polypeptides they are derived from can be used as markers in 
diagnosis of RA and in therapy as aiiti-RA vaccines. 



Rheumatoid Arthritis (RA), originally termed chronic polyarthritis, is a systemic 
autoimmune disease and one of the most debilitating forms of articular inflammation 
(Feldmann, M. et al., Cell 85 (1996) 307-310; Dedhia, H.V. & DiBartolomeo, A., Critical 
care clinics 18 (2002) 841-854). Typically, RA causes joint pain, deformities and severe 
joint stiffness. The disease can also have its manifestation outside the joints, especially in 
patients who are positive for an autoantibody, termed "rheumatoid factor" (RF) (Mageed, 
R.A., in: van Venrooij, W.J. 8c Maini, R.N. eds., Manual of biological markers of disease, 
Kluwer Academic Publishers (1996) 1-18). RA occurs quite frequently in the Caucasian 
population with the susceptibility to RA being influenced by genetic and environmental 
factors. Both have a crucial effect on the onset and the progression of this autoimmune 
disease. Approximately 4% of the total population has an increased genetic susceptibility to 
RA, roughly 20% of which (around 1% of the total population) develops RA as a result of, 
as yet, uncharacterized non-inheritable factors. Beyond that, RA shows a significant bias in 
the sex ratio: women have a three fold higher risk for RA than men, indicating that sex 
hormones may also be involved in the pathogenesis. 

In the beginning, RA progresses slowly. Typical early stage symptoms are palm 
sweating, morning stiffness of fingers and symmetrical joint inflammation. In addition, 
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rheumatoid nodules can appear which is an indication for tissue affection outside the 
joints. In a simplified model, the immune system produces autoantibodies against healthy 
tissue. These autoantibodies attack the articular cartilage in the joint leading to its 
inflammation and later on to its destruction. This destruction stimulates the immune 

5 system to produce more autoantibodies. In addition, cytokines like tumor necrosis-factor 
alpha (TNF-a) and Interleukin-1 (IL-1) are produced which enhance the inflammatory 
reaction even further (Houssiau, F.A., Clin Rheumatol 14 Suppl 2 (1995) 10-13). The 
synovium begins to swell due to infiltration of additional cells of the immune system, such 
as macrophages and T cells. These cells are actively involved in causing further cell death 

10 and in driving joint inflammation (Fox, D.A., Arthritis Rheum 40 (1997) 598-609; Choy, 
KH. & Panayi, G.S., N Engl J Med 344 (2001) 907-916). This process resembles a vicious 
circle of autoantibody production, joint inflammation and joint destruction. 

Typically, RA progresses chronically, with 85-90% of all RA patients showing a mild 
to moderate disease development. Aggressive disease forms leading to complete loss of joint 
15 function up to the degree of invalidity is experienced by 10-15% of the patients. In this 
advanced RA state, patients have a permanent articular inflammation and display 
rheumatoid nodules. They suffer from strong chronical pain and the inflammation leads to 
severe finger stiffness and irreversible joint deformations or dislocations. 

20 Diagnosis 

There is growing evidence that therapeutic intervention early in the disease can 
reduce the extent of joint damage (Egsmose, C. et al., J Rheumatol 22 (1995) 2208-2213; 
Van der Heide, A. et al., Ann Intern Med 124 (1996) 699-707). Since treatment with 
disease-modifying antirheumatic drugs (DMARDs) is only justified when the riskbenefit 

25 or cost:effectiveness ratios are favorable, it is mandatory to be able to differentiate between 
RA and other forms of arthritis shortly after onset of the disease (Kirwan, J.R. & Quilty, B., 
Clin Exp Rheumatol 15 (1997) 15-25). The diagnosis is made by established criteria based 
on clinical history, physical examination and laboratory tests. The American Society of 
Rheumatism published a catalog of criteria to help gaining objective evidence for RA 

30 (Arnett, RC. et al., Arthritis Rheum 31 (1987) 315-324). But so far, not a single test is 
available which is specific for RA. Several biological and biochemical markers, e.g. C- 
reactive protein (CRP), erythrocyte sedimentation rate (ESR), antinuclear antibody (ANA) 
or RF are utilized for the evaluation of RA. However, these markers are non-specific, as 
they appear in other inflammatory or autoimmune diseases as well. The RF, for instance, is 

35 an autoantibody that is present in the serum of approximately 50% of RA patients. Since 
increased levels of the same autoantibody can also be found in the context of other 
inflammatory diseases, such as Sjogren syndrome, endokarditis or chronical hepatitis, RF is 
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unsuitable to serve as a diagnostic marker for RA. Rather than being of diagnostic value per 
se, the above mentioned biochemical and biological markers are useful for assessing disease 
activity and prognosis as well as in the treatment and management of RA patients 
(Nakamura, R.M., J Clin Lab Anal 14 (2000) 305-313). 

5 Recently, a diagnostic set of criteria was developed that consists of clinical and 

biochemical aspects which were claimed to discriminate, at an early state, between self- 
limiting, persistent non-erosive, and persistent erosive RA (Visser, H. et al., Arthritis 
Rheum 46 (2002) 357-365). Self-limiting arthritis was characterized by natural remission: 
there was no arthritis on examination in a patient for a certain period of time. Erosive 

10 arthritis was defined based on the presence of erosions on radiographs of the hands and/or 
feet. In particular, the use of antibodies recognizing cyclic citrullinated peptides appears to 
be promising and suggests an important role for citrullinated antigens in the early diagnosis 
and prognosis of erosive RA (ScheUekens, G.A. et al., J Clin Invest 101 (1998) 273-281; 
Vincent, C. et al., J Rheumatol 25 (1998) 838-846). The early recognition of erosive RA 

15 allows early intervention with DMARDs, which will lead to earlier disease control and 
improvement of disease outcome (Symmons, D.P.M. et al., J Rheumatol 25 j(199&) 1072- 
1077; Anderson, J.J. et al., Arthritis Rheum 43 (2000) 22-29). Likewise, early recognition of 
self-limiting and non-erosive arthritis will prevent unnecessary treatment with potentially 
toxic therapeutics (Fries, J.F. et al., Arthritis Rheum 36 (1993) 297-306. 

20 Therapy 

The goal of any anti-rheumatic therapy is to relieve pain in order to ease the activities 
of every day life. So far, complete healing of RA is not possible, but by applying modern 
therapies the progression of the disease can be slowed down or even stopped. Due to 
individual differences, each patient requires an individualized therapy and early diagnosis, 

25 as mentioned before, is desirable. RA therapy is complex and includes lifelong medicinal 
treatment as well as physio- and radiotherapy. DMARDs used in RA therapy are basic 
therapeutics (e.g. Methotrexate, Sulfasalazin, Hydroxychloroquin, Leflunomid, 
Azathioprin), cortisone, non-steroidal anti-inflammatory drugs (NSAID) or monoclonal 
antibodies against the pro-inflammatory cytokines TNF-a, IL-lp or their respective 

30 receptors (http://rheuma-online.de). These drugs have all in common that they are 
inhibitors of inflammation by suppressing the immune response. The main disadvantage is 
their lack of specificity for RA, their adverse effects and their inability to effectively target 
the causes of RA. 
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Autoimmunity starts when a specific adaptive immune response is initiated against 
self antigens (autoantigens) manifested by the development of self-reactive T or B cells. The 
normal consequence of an adaptive immune response against a foreign antigen is the 
clearance of the antigen from the body. When an adaptive immune response develops 
against a self antigen, however, the antigen can in most cases not be completely removed 
from the body, leading to a sustained immune response. As a consequence, the effector 
mechanisms of immunity cause chronic inflammatory injury to tissues. The mechanisms of 
tissue damage are essentially the same in autoimmune disease as those that operate in 
protective immunity and in hypersensitivity. Even though it is not well understood what 
triggers autoimmunity, several events which are nowadays believed to contribute to the 
induction of autoimmune diseases and selection of autoantigenic targets have been 
summarized most recently (Marrack, P. et al., Nat Med 7 (2001) 899-905). 

Autoimmune diseases are controlled by properties of particular genes of each 
individual and environmental factor. The host's genes affect the susceptibility to 
autoimmunity at least at three levels. First, some of the genes affect the overall reactivity of 
the immune system and, thus, can predispose the individual to certain or to several 
different types of autoimmune diseases. Second, this altered immunoreactivity is funneled 
to particular autoantigens and tissues by other genes that affect recognition of antigenic 
peptides by T cells. Third, still other genes act on the ability of target tissues to modulate 
immune attack for instance by influencing the activity of effector cells of the immune 
system which are destined to initiate an autoaggressive attack. The latter two sets of genes 
dictate which antigens will be the targets of autoimmunity and hence which organs will be 
attacked and what damage will occur. 

In addition, signals from the environment influence the development of 
autoimmunity at the same three levels, by affecting the overall reactivity of the immune 
system, the antigen-specificity and the state of the potential target tissue. And finally, there 
is cross-talk between genetic and environmental factors. 

Major histocompatibility complex (MHC) 

Population studies, genotyping and modern approaches at the molecular level have 
unanimously shown that certain genes encoded by the major histocompatibility complex 
(MHC) confer a significantly higher risk for the development of RA (Stastny, P., Tissue 
Antigens 4 (1974) 571-579; Wordsworth, P. et al., PNAS 86 (1989) 10049-10053; 
Wordsworth, P. & Bell, J., Springer Semin Immunopathol 14 (1992) 59-78). In particular, 
the class II MHC alleles HLA-DKB1*0101, *0401, *0404 and *0405 in several ethnic groups 
increase the susceptibility to RA (Reveille, J., Curr Opin Rheumatol 10 (1998) 187-200). 
E.g. more than 90% of RF-positive RA patients carry one of these susceptibility alleles. HLA 
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class II molecules are MHC-surfkce proteins that bind antigenic peptides within the cell 
and present them on the surface of antigen-presenting cells for interaction with the T cell 
receptors of CD4 + helper T lymphocytes, thereby initiating a cellular immune response 
(Banchereau, J. & Steinman, R.M., Nature 392 (1998) 245-252). The RA-association of 
5 particular HLA class II molecules together with the presence of large numbers of activated 
CD4 + T cells in synovial tissue has supported the model of disease induction in which 
disease-associated HIA-DR molecules present disease-relevant (e.g. synovial) autoantigens 
and cause stimulation and expansion of synovial T cells, which then drive the inflammatory 
process (Striebich, C.C. et aL, J Immunol 161 (1998) 4428-4436). 

10 MHC class II HLA-DR (short: DR) proteins are heterodimers consisting of 

monomorphic a- and extremely polymorphic (3-chains that bind peptide antigens in a 
peptide binding groove. This groove generally has four major pockets to accept side chains 
at relative positions 1, 4, 6 and 9 of the peptide (Stern, L.J. et aL, Nature 368 (1994) 215- 
221). The allelic variations between HLA class II molecules account for the differential 

15 ability to bind antigenic peptides. This is the rationale why individuals differing in their 
HLA alleles have divergent antigenic peptide repertoirs, thereby leading to differences in 
the quality of immune responses (Messaoudi, L et aL, Science 298 (2002) 1797-1800). 

Peptides bound by class II MHC molecules are typically longer and more 
heterogeneous in size (11-25 amino acids) than the peptides bound by class I MHC 
20 molecules (8-10 amino acids). This difference arises because the peptide binding groove of 
class II proteins is open and while peptides are gripped in the midle, their ends can extend 
out of the groove in a variable fashion (Jones, E.Y., Curr Opin Immunol 9 (1997) 75-79). 
As a consequence, class II molecules typically bind sets of overlapping peptides that share a 
common core sequence, termed "T cell epitope", but have different lengths. 

25 More than a decade ago, it was recognized that the DRJ3 chains encoded by RA-linked 

DRB1 alleles, although exhibiting polymorphic differences, all share a stretch of identical or 
almost identical amino acids at positions 67-74, known as the "shared epitope* (Gregersen, 
P.K. et aL, Arthritis Rheum 30 (1987) 1205-1213). Since immunity to autoantigens has 
been regarded central to the pathogenesis of RA, it was hypothesized that the shared 

30 epitope could impose disease linkage on the respective DR molecules by at least two 
different mechanisms: first, by selecting the relevant autoantigenic peptides for 
presentation, and second, by selecting the appropriate autoreactive Tcell specificities 
during ontogeny. The three-dimensional structure of DR molecules has indeed revealed 
that the shared epitope is located in the center of the a-helix flanking one side of the 

35 peptide binding groove (Stern, L.J. et aL, Nature 368 (1994) 215-221). Thus, strategically 
this shared epitope region is positioned in such a way that it can interact with both bound 
peptide and T cell receptor. 
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However, one of the unresolved mysteries in rheumatology research is the question 
what are the key arthritogenic antigens and epitopes in man that trigger the onset and the 
development of RA. Although autoantibodies of different specificity have been identified in 
serum and synovial fluid of patients it is often unclear whether the antigens which were 
5 released at the time of cartilage degradation, were initiating pathogenicity or whether they 
are merely a consequence of antigen spreading as a result of inflammation (Corrigall, V.M. 
& Panayi G.S., Crit Rev Immunol 22 (2002) 281-293). Furthermore it is difficult to define 
pathogenic mechanisms in which the antigen is present throughout the body, including the 
joint, but the pathology is targeted solely or predominately to the joint 

10 Autoantigens 

The large number of possible autoantigens in RA is derived from studies using sera 
or, less frequently, T cells from patients with established chronic RA. One of the most 
convincing joint-specific antigen that has been proposed in the context of DR molecules, is 
type II collagen (CII), the predominant protein in articular cartilage. Autoantibodies 

15 against CII were found in elevated concentrations in the serum and joints of RA patients 
although it is not yet clear whether anti-CII antibodies are pathogenic in RA (Banerjee, S. et 
al., Clin Exp Rheumatol 6 (373-380). Snowden and coworkers have shown that peripheral 
blood T cells from RA patients proliferated to CII, most pronounced in those patients with 
anti-CII antibodies. However, the response was seen only in 50% of patients (Snowden, N. 

20 et al., Rheumatology 40 (1997) 1210-1218). In a mouse model immunization with CII was 
shown to induce arthritis in mice expressing the class II MHC alleles DRB1*0401 and *01 01 
(Rosloniec, E.F. et al., J Exp Med 185 (1997) 1113-1122; Rosloniec, E.F. et al., J immunol 
160 (1998) 2573-2578). The immunodominant epitope in both *0401 and *0101 transgenic 
mice was localized to peptides within residues 261-273 of human CII (Fugger, L. et al., Eur 

25 J Immunol 26 (1996) 928-933). The same epitope of CII was capable of stimulating a T cell 
response in RA patients, particularly in the early stages of disease. Synovial fluid T cells 
were especially responsive (Kim, H.Y. et al., Arthritis Rheum 42 (1999) 2085-2093). 

Although other cartilage proteins have been proposed as RA candidate antigens, 
DR4-binding epitopes have been defined only for human cartilage glycoprotein 39 

30 (HCgp39). This protein is secreted by synovial cells and articular chondrocytes and its 
expression is upregulated in plasma and joints during inflammation (Vos, K. et al., Ann 
Rheum Dis 59 (2000) 544-548). Similar to CII, HCgp39 treatment induces arthritis in 
mice. In addition a HCgp39 response of peripheral blood T cells from RA patients was 
detected (Verheijden, G.F. et al., Arthritis Rheum 40 (1997) 1115-1125). The predominant 

35 epitope recognized by T cells in DR4 patients was defined between residues 263-275 and 
identical to the immunodominant epitope found in DRB1 *0401 -transgenic mice after 
immunization with native HCgp39 (Cope, A.P. et al., Arthritis Rheum 42 (1999) 1497- 
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1507). Although not disease specific, responses to this peptide did correlate with disease 
activity in RA patients (Baeten, D. et al., Arthritis Rheum 43 (2000) 1233-1243). Antibodies 
to HCgp39, however, have also been detected in the sera of patients with inflammatory 
diseases, such as inflammatory bowel disease and systemic lupus erythematosus (SLE), 
5 albeit at a lower level than in RA. 

In an attempt to track antigen-specific T cells in RA, soluble peptide-DR4 tetrameric 
complexes were used to detect synovial CD4 + T cells reactive with CII or HCgp39 in DR4 + 
patients (Kotzin, B.L. et al., PNAS 97 (2000) 291-296). The CII-DR4 complex bound in a 
specific manner to CII peptide-reactive T cell hybridomas, but did not stain a detectable 
10 fraction of synovial CD4 + cells. Almost similar results were obtained with the HCgp39-DR4 
complex suggesting that the major oligoclonal CD4 + T cell expansions present in RA joints 
are not specific for the dominant CII and HCgp39 determinants described above. 

In summary, despite some strong indications for a CII and HCgp39 association with 
RA, the evidence that they are important antigens in RA is scanty. A direct proof that 
15 peptides of CII or HCgp39 are presented in a class II MHC-restricted manner by antigen- 
presenting cells with subsequent stimulation and activation of synovial CD4 + T cells is still 
lacking. Furthermore a major problem of animal models is their unknown relevance to RA 
as Cll-induced arthritis by immunizing rats or mice differs in many respects from RA. 

Naturally processed MHC class Il-associated peptides 

20 An alternative strategy to the identification of RA-specific autoantibodies and T cells 

relies on the sequence analysis of naturally processed peptide antigens bound to MHC class 
II molecules. With the help of monoclonal antibodies, class II MHC molecules conferring 
susceptibility to RA can be purified from cognate cells. RA-associated peptide antigens can 
be acid-eluted from purified HLA class II molecules. The mixture of small peptides can be 

25 separated by HPLC and the peptide sequence be determined by Edman sequencing or mass 
spectrometry. Due to limitations with peptide purification and sequencing techniques, 
peptide sequences were, as yet, only obtained from MHC molecules that have been isolated 
from cultured B cell lines or large amounts of tissue, and the analysis was restricted to a few 
abundant peptides (Kropshofer et al., J.Exp.Med. 175 (1992) 1799-1803; Chicz, R.M. et al., 

30 J Exp Med 178 (1993) 27-47). As a result of the development of high-resolution 
microcapillary HPLC columns and more sensitive mass spectrometers, MHC-bound 
peptides can be analyzed more efficiently (Dongre, A.R. et al., Eur J Immunol 31 (2001) 
1485-1494; Engelhard, V.H. et al., Mol Immunol 39 (2002) 127-137). 

In the present invention a modified peptide isolation and sequencing technique was 
35 used to investigate the peptide antigen repertoire of HLA-DR4 molecules derived from 
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autologous dendritic cells (DCs) which were pulsed with serum or synovial fluid derived 
from RA patients. The main advantage of this innovative approach is the usage of human 
DCs that are professionals in RA-relevant antigen processing and presentation, instead of 
using transgenic animal models or artificial B cell lines. 

5 DCs are enriched in rheumatoid synovial fluid and tissue and are derived from 

circulating immature precursors (Thomas, R. et aL, J Immunol 152 (1994) 2613-2623). 
They are the most potent antigen-presenting cells which express high levels of MHC 
molecules together with a variety of accessory molecules (Mellman, L et aL, Trends Cell 
Biol 8 (1998) 231-237). In a most recent study, it was shown that ex vivo differentiated 

10 human DCs and macrophages that are phenotypically similar to antigen-presenting cells 
from RA synovial joints, were capable of generating and presenting immunodominant 
epitopes from CH and HCgp39 (Tsark, KC. et aL, J Immunol 169 (2002) 6625-6633). DC 
have the capacity to prime CD4 + helper T cells and to effectively activate cytotoxic CD8 + 
T cells (Ridge, T. et aL, Nature 393 (1998) 474-478). Thus, peptides bound to MHC class II 

15 molecules and presented by DCs play a superior role in the pathogenesis of diseases 
involving T cell-driven immune responses. 

Therefore, the problem posed by the lack of knowledge of MHC class II restricted 
antigenic peptides for RA is solved by providing novel naturally-processed MHC class II 
associated RA antigenic peptides and the polypeptides they are derived from as markers for 
20 RA. 

The present invention" provides novel naturally-processed antigenic peptides which are 
candidate RA markers in erosive and non-erosive RA. These antigenic peptides are 
presented by human MHC class II HLA-DR molecules derived from dendritic cells which 
were pulsed with serum or synovial fluid derived from patients with established erosive or 

25 non-erosive RA. The MHC class II antigenic peptide of the invention are comprising (a) at 
least the amino acid sequence of the peptide binding motif selected from the group 
consisting of SEQ ID NOs. 49 to 57 and SEQ ID NOs. 103 to 122, or (b) at least the amino 
acid sequence of the peptide binding motif selected from the group consisting of SEQ ID 
NOs. 49 to 57 and SEQ ID NOs. 103 to 122 with additional N-and C-terminal flanking 

30 sequences of a corresponding sequence selected from the group consisting of SEQ ID NOs. 
1 to 39 or SEQ ID NOs. 58 to 102, and originate from interferon-y-inducible lysosomal 
thiol reductase, integrin beta-2, phosphatitylinositol-4,5-bisphosphate 3-kinase, urokinase- 
type plasminogen activator, immunoglobulin heavy chain V-III region (Vh26), DJ-1 
protein, apolipoprotein B-100, 26S proteasome non-ATPase regulatory subunit 8, 

35 interleukin-1 receptor, fibromodulin, GM-CSF/IL-3/IL-5 receptor, sorting nexin 3, inter-a- 
trypsin inhibitor heavy chain H4, complement C4, complement C3 (oc-chain), complement 
C3 (P-chain), SH3 domain-binding glutamic acid-rich-like protein 3, interleukin-4- 
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induced protein 1, hemopexin, Hsc70-interacting protein, invariant chain (Ii), retinoic acid 
receptor responder protein 2, fibronectin, cathepsin B, tripeptidyl-peptidase II, legumain, 
platelet activating factor receptor, poly-alpha-2.8-sialyltransferase, and ras-leated protein 
Rab-1 IB. The present invention also provides these antigenic peptides and the proteins 
5 they are derived from as markers for erosive and/or non-erosive RA. Moreover, these 

antigenic peptides linked to MHC class II molecules, as well as antibodies reactive with said 
antigenic peptides, nucleic acids encoding said antigenic peptides, and nucleic acid 
constructs, host cells and methods for expressing said antigenic peptides are provided. 
Further methods are provided for isolating and identifying RA antigenic peptides. 

10 

Fig. 1: Diagram of Dendritic cell (DC)-mediated analysis of tissue samples: 

: 

Dendritic cells (DCs), the most specialized antigen-presenting cells (APCs), are brought in 
contact with an antigen source (e.g. synovial fluid) under optimal conditions for antigen 
uptake and antigen processing. As a control, DCs are cultured under the same conditions 
15 in the absence of synovial fluid antigens. After maturation of DCs, antigen-loaded MHC 
class II molecules are purified and the respective MHC class II-associated antigenic 
peptides are isolated and identified. 

Fig. 2A: ION-TRAP MS Base Peak Chromatogram of MHC class II-associated 
antigenic peptides that were isolated from dendritic cells pulsed with the serum of a RA 
20 patient. The peptides were eluted directly from a RP-C18-HPLC column into the ion trap 
mass spectrometer for immediate MS/MS identification. The numbers indicate the 
retention times (upper value) and the molecular masses (lower value) of the most 
prominent peptide peaks in the mixture at the respective time. 

Fig. 2B: ION-TRAP MS spectrum of antigenic peptides at a retention time of 
25 65.4 min. The marked peak was further fragmented and corresponded to a doubly charged 
peptide ion from the inter-alpha-trypsin inhibitor ITIH4 (cf. table 3). 

Fig. 2C: ION-TRAP MS/MS spectrum of the doubly charged peptide ion at m/z 
977.1. The fragmentation masses, together with the mass of the parent ion, were searched 
against a non-redundant human database by using the SEQUEST algorithm. The retrieved 
30 sequence MPKNVVFVIDKSGSMSGR (one-letter-code) corresponded to the dominant 
epitope ITIH4 (271-288) of the inter-alpha-trypsin inhibitor. The positions of the assigned 
series of N-terminal B-ions and C-terminal Y-ions are marked. 

Fig. 3: Summary of the differential binding capacity of the tested candidate RA 
antigens in the context of binding to the HLA-DRB1 *0401 allele. The putative HLA- 
35 DKB1*0401 binding motif is boxed in grey. and. As a measure for affinity, the peptide 
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concentration was determined that was needed to reduce binding of a fixed amount of 
biotinylated HA(307-319) peptide by 50% (IC 5 o) through competition. The reciprocal (1/ 
IC50) directly correlates with peptide affinity; As a reporter biotinylated HA(307-319) 
peptide from influenza hemagglutinin (Rothbard, J.B. et al., Cell 52 (1988) 515-523) was 
5 included in the study. 



The antigenic peptides of the invention are peptides, which are associated with and 
presented by MHC molecules and thereby can have the potential to activate or tolerize 
10 T cells- Antigenic peptides presented by MHC class II molecules are therefore MHC class II 
associated or MHC class II antigenic peptides, whereas antigenic peptides presented by 
MHC class I molecules are MHC class I associated or MHC class I antigenic peptides. 

Peptides which are derived from proteins that are encoded in the genome of the body 
or an APC are denoted as "self-peptides". The main function of self-peptides presented by. 
15 DCs in the peripheral lymphoid organs is thought to be the induction of T cell tolerance to 
self-proteins. Tolerance is the failure to respond to an antigen; when that antigen is borne 
by self tissues, tolerance is called self tolerance. 

Antigens which are derived from an individual's own body are called "self antigens" 
or "autoantigens". An adaptive immune response directed against self antigens is called an 

20 autoimmune response. Likewise, adaptive immunity specific for self antigens is called 
autoimmunity. Autoreactivity describes immune responses directed against self antigens. 
RA is probably due to an autoimmune response that is based on the involvement of 
autoreactive T cells and/ or autoreactive antibodies. Immunogenic peptide includes, but is 
not limited to, an antigenic peptide capable of causing or stimulating a cellular or humoral 

25 immune response. Such peptides may also be reactive with antibodies. 

Peptides derived from proteins encoded in the genome of bacteria, viruses or other 
foreign invaders and which differ from self-proteins are called "foreign antigenic" or 
"foreign" peptides. They are able to elicit a T cell response against foreign proteins they are 
derived from. 

30 RA antigenic peptides are self-peptides that function as self antigens and as a 

consequence of the disease erroneously trigger autoreactivity against self tissues. 
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The present invention provides a MHC class II antigenic peptide comprising (a) at 
least the amino acid sequence of the peptide binding motif selected from the group 
consisting of SEQ ID NOs. 49 to 57 and SEQ ID NOs. 103 to 122, or (b) at least the amino 
acid sequence of the peptide binding motif selected from the group consisting of SEQ ID 
NOs. 49 to 57 and SEQ ID NOs. 103 to 122, with additional N-and C-terminal flanking 
sequences of a corresponding sequence selected from the group consisting of SEQ ID NOs. 
1 to 39 and SEQ ID NOs. 58 to 102. Preferably, the MHC class II antigenic peptide has a 
length of less than 26 amino acids, more preferably a length of 11 to 25 amino acids. Even 
more preferred is the antigenic peptide of the invention with a length of 11 to 19 amino 
acids. Most preferred is the antigenic peptide of the invention consisting of the peptide 
binding motif comprising the four anchor amino acids. 

The present invention also provides a MHC class II antigenic peptide comprising (a) 
at least the amino acid sequence of the peptide binding motif of SEQ ID NO. 49, or (b) at 
least the amino acid sequence of the peptide binding motif of SEQ ID NO, 49 with 
additional N-and C-terminal flanking sequences of a corresponding sequence selected from 
the group consisting of SEQ ID NOs. 1 to 3. . 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 103, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 103 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NOs. 58 and 
59. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 104, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 104 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 60. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 105, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 105 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 61. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 106, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 106 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 62. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 107, or (b) at least the 
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amino acid sequence of the peptide binding motif of SEQ ID NO. 107 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 63. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 50, or (b) at least the 
5 amino acid sequence of the peptide binding motif of SEQ ID NO . 50 with additional N-and 
C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 5. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 108, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 108 with additional N- 
10 and C-terminal flanking sequences of the corresponding sequence of SEQ ID NOs. 64 to 
67. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 109, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 109 with additional N- 
15 and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 68. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 110, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 110 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NOs. 69 and 
20 70. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. Ill, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. Ill with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 72. 

25 Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 

amino acid sequence of the peptide binding motif of SEQ ID NO. 112, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 112 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 73. 

The MHC class II associated novel antigenic peptides of the invention originate from 
30 interferon-y-inducible lysosomal thiol reductase (SEQ ID NOs. 1 to 3), integrin beta-2 
(SEQ ID NOs. 58 and 59), phosphatitylinositol-4,5-bisphosphate 3-kinase (SEQ ID NO: 
60), urokinase-type plasminogen activator (SEQ ID NO: 61), immunoglobulin heavy chain 
V-III region (V H 26) (SEQ ID NO: 62), DJ-1 protein (SEQ ID NO: 63), apolipoprotein B- 
100 (SEQ ID NOs. 4 and 5), 26S proteasome non-ATPase regulatory subunit 8 (SEQ ID 
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NOs. 64 to 67), interleukin-1 receptor (SEQ ID NO: 68), fibromodulin (SEQ ID NOs.: 69 
and 70), GM-CSF/IL-3/IL-5 receptor (SEQ ID NOs, 71 and 72), sorting nexin 3 (SEQ ID 
NO: 73), inter-a-trypsin inhibitor heavy chain H4 (SEQ ID NOs. 6 to 12), complement C4 
(SEQ ID NOs. 13 to 18), complement C3 (a-chain) (SEQ ID NOs. 19 to 23, 74 and 75), 
5 complement C3 (P-chain) (SEQ ID NOs. 76 and 77), SH3 domain-binding glutamic acid- 
rich-like protein 3 (SEQ ID NOs. 24 to 27), interleukin-4-induced protein 1 (SEQ ID NOs. 
28 to 30), hemopexin (SEQ ID NOs. 31 to 35 and 78), Hsc70-interacting protein (SEQ ID 
NOs. 36 to 39), invariant chain (Ii) (SEQ ID NOs. 79 to 83), retinoic acid receptor 
responder protein 2 (SEQ ID NOs. 84 to 86), fibronectin (SEQ ID NOs. 87 to 91), 
10 cathepsin B (SEQ ID NO: 92), tripeptidyl-peptidase H (SEQ ID NOs. 93 and 94), legumain 
(SEQ ID NO: 95), platelet activating factor receptor (SEQ ID NO: 96), poly-alpha-2.8- 
sialyltransferase (SEQ ID NO: 97), and ras-leated protein Rab-llB (SEQ ID NOs. 98 to 
102). 

The single peptide binding groove of MHC class II molecules is about 25 A long, but 
15 in contrast to MHC class I molecules, both sides are open (Stern LJ et al., Nature 1994; 368, 
215-221). Thus, naturally processed antigenic peptides eluted from human MHC class II 
molecules have a minimal length of about 11 residues and attain a maximal length of about 
25 residues (Chicz RM et al., J Exp Med 1993; 178, 27-47). 

The stability of the MHC-peptide interaction is determined by more than a dozen 

20 hydrogen bonds involving the peptide backbone and the complementarity between 
specificity pockets of the binding groove and appropriately located amino acid side-chains 
of the peptide. The amino acids of the peptide fitting into the respective pockets were 
named "anchor" residues. With regard to most HLA-DR alleles, these anchors are located at 
relative positions PI, P4, P6 and P9. The combination of amino acids at these 4 anchor 

25 positions conferring high-stability binding to the respective HLA-DR allelic product and 
vary from allele to allele. The peptide binding motif is defined herein as the sequence of 
nine amino acids comprising the four anchor amino acids. The peptide binding motif of 
the MHC class II antigenic peptide of the invention is depicted in SEQ ID NO. 49 for the 
peptides derived from interferon-y-inducible lysosomal thiol reductase (SEQ ID NOs. 1 to 

30 3), in SEQ ID NO. 103 for the peptides derived from integrin beta-2 (SEQ ID NOs. 58 and 
59), in SEQ ID NO. 104 for the peptides derived from phosphatitylinositol-4,5- 
bisphosphate 3-kinase (SEQ ID NO: 60), in SEQ ID NO. 105 for the peptides derived from 
urokinase-type plasminogen activator (SEQ ID NO: 61), in SEQ ID NO. 106 for the 
peptides derived from immunoglobulin heavy chain V-III region (V H 26) (SEQ ID NO: 62), 

35 in SEQ ID NO. 107 for the peptides derived from DJ-1 protein (SEQ ID NO: 63), in SEQ 
ID NO. 50 for the peptides derived from apolipoprotein B-100 (SEQ ID NOs. 4 and 5), in 
SEQ ID NO. 108 for the peptides derived from 26S proteasome non-ATPase regulatory 
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subunit 8 (SEQ ID NOs. 64 to 67), in SEQ ID NO. 109 for the peptides derived from 
interleukin-1 receptor (SEQ ID NO: 68), in SEQ ID NO. 110 for the peptides derived from 
fibromodulin (SEQ ID NOs.: 69 and 70), in SEQ ID NO. Ill for the peptides derived from 
GM-CSF/IL-3/IL-5 receptor (SEQ ID NOs. 71 and 72), in SEQ ID NO. 112 for the peptides 
5 derived from sorting nexin 3 (SEQ ID NO: 73), in SEQ ID NO. 51 for the peptides derived 
from inter-a-trypsin inhibitor heavy chain H4 (SEQ ID NOs. 6 to 12), in SEQ ID NO. 52 
for the peptides derived from complement C4 (SEQ ID NOs. 13 to 18), in SEQ ID NO. 53 
for the peptides derived from complement C3 (cc-chain) (SEQ ID NOs. 19 to 23, 74 and 
75), in SEQ ID NO. 113 for the peptides derived from complement C3 (p-chain) (SEQ ID 

10 NOs. 76 and 77), in SEQ ID NO. 54 for the peptides derived from SH3 domain-binding 
glutamic acid-rich-like protein 3 (SEQ ID NOs. 24 to 27), in SEQ ID NO. 55 for the 
peptides derived from interleukin-4-induced protein 1 (SEQ ID NOs. 28 to 30), in SEQ ID 
NO. 56 for the peptides derived from hemopexin (SEQ ID NOs. 31 to 35 and 78), in SEQ 
ID NO. 57 for the peptides derived from Hsc70-interacting protein (SEQ ID NOs. 36 to 

15 39), in SEQ ID NO. 114 for the peptides derived from invariant chain (Ii) (SEQ ID NOs. 79 
to 83), in SEQ ID NO. 115 for the peptides derived from retinoic acid receptor responder 
protein 2 (SEQ ID NOs. 84 to 86), in SEQ ID NO. 116 for the peptides derived from 
fibronectin (SEQ ID NOs. 87 to 91), in SEQ ID NO. 117 for the peptides derived from 
cathepsin B (SEQ ID NO: 92), in SEQ ID NO. 118 for the peptides derived from 

20 tripeptidyl-peptidase II (SEQ ID NOs.93 and 94), in SEQ ID NO. 119 for the peptides 
derived from legumain (SEQ ID NO: 95), in SEQ ED NO. 120 for the peptides derived from 
platelet activating factor receptor (SEQ ID NO: 96), in SEQ ID NO. 121 for the peptides 
derived from poly-alpha-2.8-sialyltransferase (SEQ ID NO: 97) and in SEQ ID NO. 122 for 
the peptides derived from Ras-related protein Rab-llB (SEQ ID NO: 98 to 102). 

25 The peptide binding motif may also comprise at least one, at least two, at least three, 

at least four or at least five modifications of the amino acid sequence while still attaining 
the binding capacity of the non-modified peptide binding motif. Preferably, the modified 
peptide binding motif comprises at least three of the four anchor amino acids of the non- 
modified peptide binding motif The amino acid modification may be a conservative 

30 amino acid substitution as described below. 

Additional binding energy is provided by hydrogen bonds involving residues in front 
of the PI anchor and behind the P9 anchor. In agreement with that, in most naturally 
processed peptides the nonameric core-region (P1-P9) is N- and C-terminally flanked by 
3-4 residues. Hence, the majority of peptides are 15-17-mers. Longer peptides protrude 
35 from the groove, thereby allowing access of exopeptidases which are trimming both ends. 

Therefore, the MHC class II antigenic peptide of the invention comprising (a) at least 
the amino acid sequence of the peptide binding motif selected from the group consisting of 
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SEQ ID NOs. 49 to 57 and SEQ ID NOs. 103 to 122, or (b) at least the amino add sequence 
of the peptide binding motif selected from the group consisting of SEQ ID NOs. 49 to 57 
and SEQ ID NOs. 103 to 122 with additional N- and C-terminal flanking sequences of a 
corresponding sequence selected from the group consisting of SEQ ID NOs. 1 to 39 and 
5 SEQ ID NOs. 58 to 102, preferably comprises additional N- and C- terminal flanking 
amino acid residues providing additional binding energy. 

Preferably, the MHC class II antigenic peptide of the present invention has. a binding 
capacity to the corresponding MHC class II molecule of between one tenth and ten-fold the 
IC50 of a corresponding peptide selected from the group consisting of SEQ ID NOs. 1 to 39 
10 and SEQ. ID NOs. 58 to 102. The binding capacity of a peptide is measured by determining 
the concentration necessary to reduce binding of a labelled reporter peptide by 50%. This 
value is called IC 50 . A MHC class II antigenic peptide of the invention maintains its binding 
capacity to the relevant HLA class II molecules as long as it attains IC50 values between one 
tenth and 10-fold the IC50 of the established reference peptides. 

15 

Since peptide trimming occurs in an individual fashion both before and after binding 
into the peptide binding groove, the occurrence of several truncation variants sharing a 
common nonameric core region is a common feature of MHC class II-bound peptides. 
Importantly, it was shown that C- or N-terminal truncation variants of the same epitope 
20 can trigger divergent T cell responses (Arnold et al., (2002) J. Immunol. 169, 739-749). 

Several parameters can be envisaged that have an influence on the relative abundance 
of truncation variants of a particular epitope, e.g. the abundance and integrity of the 
antigen of relevance, antigen-associated proteins, the abundance of proteases, the type of 
proteases available and the supply with competitive antigens and/or peptides. Since the 
25 antigen supply is a major characteristic that may correlate with the origin of a sample, the 
ratio of particular truncation variants of an epitope can be of diagnostic value. 

A peptide of the invention is a peptide which either has no naturaUy-occurring 
counterpart (e.g., such as an mutated peptide antigen), or has been isolated, i.e., separated 

30 or purified from components which naturally accompany it, e.g., in tissues such as 
pancreas, liver, spleen, ovary, testis, muscle, joint tissue, neural tissue, gastrointestinal 
tissue, or body fluids such as blood, serum, synovial fluid or urine. Typically, the peptide is 
considered "isolated" when a preparation comprising a peptide of the invention consists to 
at least 70%, by dry weight of said peptide and to less than 30% of the proteins and 

35 naturally-occurring organic molecules with which it is naturally associated. Preferably, a 
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preparation of a peptide of the invention consists of at least 80%> more preferably at least 
90%, and most preferably at least 99%, by dry weight, the peptide of the invention. Since a 
peptide that is chemically synthesized is, by its nature, separated from the components that 
naturally accompany it, the synthetic peptide is "isolated". 

5 

The invention further provides analogs of the antigenic peptide of the invention. The 
term analog includes any peptide which displays the functional aspects of these antigenic 
peptides comprising the binding capacity IC 5 o and the recognition by antibodies and cells 
of the immune system. Analogs exhibit essentially the same IC 5 o as the corresponding 
10 reference peptide. The term analog also includes conservative substitutions or chemical 
derivatives of the peptides. 

The term "analog" includes any polypeptide having an amino acid residue sequence 
substantially identical to the sequences described herein in which one or more residues 
have been conservatively substituted with a functionally similar residue and which displays 

15 the functional aspects of the peptides as described herein. Examples of conservative 
substitutions include the substitution of one non-polar (hydrophobic) residue such as 
phenylalanine, tyrosine, isoleucine, valine, leucine or methionine for another, the 
substitution of one polar (hydrophilic) residue for another such as between arginine and 
lysine, between glutamine and asparagine, between threonine and serine, the substitution 

20 of one basic residue such as lysine, arginine or histidine for another, or the substitution of 
one acidic residue, such as aspartic acid or glutamic acid for another. 

The phrase "conservative substitution" also includes the use of a chemically 
derivatized amino acid in place of a non-derivatized amino acid. "Chemical derivative" 
refers to a subject polypeptide having one or more amino acids chemically derivatized by 

25 reaction of a functional side group. Examples of such derivatized molecules include for 
example, those molecules in which free amino groups have been derivatized to form amine 
hydrochlorides, p-toluene sulfonyl groups, carbobenzoxy groups, t-butyloxycarbonyl 
groups, chloroacetyl groups, acetyl groups or formyl groups. Free carboxyl groups may be 
derivatized to form salts, methyl and ethyl esters or other types of esters or hydrazides. Free 

30 hydroxyl groups may be derivatized to form O-acyl or O-alkyl derivatives. The imidazole 
nitrogen of histidine may be derivatized to form N-im-benzylhistidine. Also included as 
chemical derivatives are those proteins or peptides, which contain one or more naturally- 
occurring amino acid derivative of the twenty standard amino acids. For examples: 4- 
hydroxyproline may be substituted for proline; 5-hydroxylysine may be substituted for 

35 lysine; 3-methylhistidine may be substituted for histidine; homoserine may be substituted 
for serine; and ornithine or citrulline maybe substituted for lysine. 
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The MHC class II antigenic peptides of the invention and the proteins they are 
derived from can be used as markers in diagnosis of RA and in therapy as anti-RA vaccines. 
The term marker as used herein refers to a biomolecule, preferably a peptide or a 
5 polypeptide, which is expressed in a group of patients with a diagnosed disease, e.g. RA, 
and attains an abundance that is significantly increased or decreased as compared to a 
control group. 

The marker of the present invention may be used as a prognostic marker to predict 
the susceptibility to a disease, e.g., to predict the susceptibility to RA, as a diagnostic marker 
10 for the diagnosis of a disease, e.g. for the diagnosis of RA, as a differential diagnostic marker 
to differentiate between different forms of a disease, e.g., to differentiate between different 
forms of RA, as a prognostic marker for the prediction of the outcome of a disease, e.g., for 
the prognosis of RA, and as a response marker to determine the efficacy of a therapeutic 
regime, e.g., as a response marker in the treatment of RA. 

15 

In a further embodiment, the MHC class II antigenic peptide comprising (a) at least 
the amino acid sequence of the peptide binding motif selected from the group consisting of 
SEQ ID NOs. 49 to 57 and SEQ ID NOs. 103 to 122, or (b) at least the amino acid 
sequence of the peptide binding motif selected from the group consisting of SEQ ID NOs. 
20 49 to 57 and SEQ ID NOs. 103 to 122, with additional N-and C-terminal flanking 
sequences of a corresponding sequence selected from the group consisting of SEQ ID NOs. 
1 to 39 and SEQ ID NOs. 58 to 102, is used as a marker for erosive and/or non-erosive RA. 

In a further embodiment, the MHC class II antigenic peptide comprising (a) at least 
the amino acid sequence of the peptide binding motif of SEQ ID NO. 49, or (b) at least the 
25 amino acid sequence of the peptide binding motif of SEQ ID NO. 49 with additional N-and 
C-terminal flanking sequences of a corresponding sequence selected from the group 
consisting of SEQ ID NOs. 1 to 3 is used as a marker for non-erosive RA. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 103, or (b) at least the 
30 amino acid sequence of the peptide binding motif of SEQ ID NO. 103 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NOs. 58 and 
59 is used as a marker for non- erosive RA. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 104, or (b) at least the 
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amino acid sequence of the peptide binding motif of SEQ ID NO. 104 with additional N- 
and C-terifiinal flanking sequences of the corresponding sequence of SEQ ID NO. 60 is 
used as a marker for non-erosive RA. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
5 amino acid sequence of the peptide binding motif of SEQ ID NO. 105, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 105 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 61 is 
used as a marker for non-erosive RA. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a). at least the 
10 amino acid sequence of the peptide binding motif of SEQ ID NO. 106, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 106 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 62 is 
used as a marker for non-erosive RA. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
15 amino acid sequence of the peptide binding motif of SEQ ID NO. 107, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 107 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 63 is 
used as a marker for non-erosive RA. 

In a further embodiment, the MHC class II antigenic peptide comprising (a) at least 
20 the amino acid sequence of the peptide binding motif of SEQ ID NO. 50, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 50 with additional N-and 
C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 5 is used as a 
marker for erosive RA. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
25 amino acid sequence of the peptide binding motif of SEQ ID NO. 108, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 108 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NOs. 64 to 67 
is used as a marker for erosive RA. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
30 amino acid sequence of the peptide binding motif of SEQ ID NO. 109, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 109 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 68 is 
used as a marker for erosive RA. 
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Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 110, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 110 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NOs. 69 and 
5 70 is used as a marker for erosive RA. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. Ill, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. Ill with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 72 is 
10 used as a marker for erosive RA. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 112, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 112 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 73 is 
15 used as a marker for erosive RA. 



In a further embodiment, the MHC class II antigenic peptides of the invention as 
20 described above are provided linked to a MHC class II molecule. 

Multimers (e.g., dimers, trimers, tetramers, pentamers, hexamers or oligomers) of a 
class II MHC molecule containing a covalently or non-covalently bound peptide according 
to the present invention, if conjugated with a detectable label (e.g., a fluorescent moiety, a 
radionuclide, or an enzyme that catalyzes a reaction resulting in a product that absorbs or 

25 emits light of a defined wavelength) can be used to quantify T cells from a subject (e.g., a 
human patient) bearing cell surface receptors that are specific for, and therefore will bind, 
such complexes. Relatively high numbers of such T cells are likely to be diagnostic of 
disease or an indication that the T cells axe involved in immunity to the disease. In 
addition, continuous monitoring of the relative numbers of multimer-binding T cells can 

30 be useful in establishing the course of a disease or the efficacy of therapy. Such assays have 
been developed using tetramers of class I MHC molecules containing an HIV-l-derived or 
an influenza virus- 15 derived peptide (Altaian et al. (1996), Science 274:94-96; Ogg et al. 
(1998), Science 279:2103- 21061), and corresponding class II MHC multimers would be 
expected to be similarly useful. Such complexes could be produced by chemical cross- 

35 linking of purified class II MHC molecules assembled in the presence of a peptide of 
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interest or by modification of already established recombinant techniques for the 
production of class II MHC molecules containing a single defined peptide (Kazono et al. 
(1994), Nature 369:151-154; Gauthier et al. (1998), Proc. Natl. Acad. Sci. U.S.A. 95:11828- 
118331). The class II MHC molecule monomers of such multimers can be native molecules 
5 composed of full-length alpha and beta chains. Alternatively, they can be molecules 
containing either the extracellular domains of the alpha and beta chains or the alpha and 
beta chain domains that form the "walls" and "floor" of the peptide-binding cleft. 

The invention also relates to an antibody, fragments or derivatives thereof, directed 
to and reactive with the above-described MHC class II antigenic peptides. The general 

10 methodology for producing antibodies is well known and is disclosed per example in 
Kohler and Milstein, 1975, Nature 256,494 or in J. G. R. Hurrel, Monoclonal Hybridoma 
Antibodies: Techniques and Applications, CRC Press Inc., Boco Raron, FL (1982). The 
antibodies can be polyclonal or, preferably, monoclonal, or antibody fragments like be F 
(ab ! ) 2, Fab, Fv or scFv. The antibodies of the present invention may also be humanized 

15 (Merluzzi S. et al., (2000), Adv. Clin. Path., 4(2): 77-85) or human antibodies (Aujame L. et 
al., Hum. Antibodies, (1997), 8(4): 155-168). 

The present invention also provides a nucleic acid molecule encoding a MHC class II 
antigenic peptide of the invention comprising (a) at least the amino acid sequence of the 
peptide binding motif selected from the group consisting of SEQ ID NOs. 49 to 57 and 
20 SEQ ID NOs. 103 to 122, or (b) at least the amino acid sequence of the peptide binding 
motif selected from the group consisting of SEQ ID NOs. 49 to 57 and SEQ ID NOs. 103 to 
122, with additional N- and C-terminal flanking sequences of a corresponding sequence 
selected from the group consisting of SEQ ID NOs. 1 to 39 and SEQ. ID NOs. 58 to 102. 
Preferably, the nucleic acid molecule is a DNA molecule. 

25 Furthermore, a nucleic acid molecule is provided encoding a MHC class II antigenic 

peptide of the invention linked to a MHC class II molecule. 

This invention also provides a recombinant nucleic acid construct comprising the 
nucleic acid molecules as described above, operably linked to an expression vector. 
Expression vectors suitable for use in the present invention comprise at least one 
30 expression control element operably linked to the nucleic acid sequence encoding the 
antigenic peptide or the antigenic peptide linked to a MHC class II molecule. The 
recombinant expression construct maybe a DNA construct. 

The expression control elements are inserted in the vector to control and regulate the 
expression of the nucleic acid sequence encoding the antigenic peptide of the invention. 
35 Examples of expression control elements include, but are not limited to, lac system, 
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operator and promoter regions of phage lambda, yeast promoters and promoters derived 
from polyoma, adenovirus, retrovirus or SV40. Additional preferred or required 
operational elements include, but are not limited to, leader sequence, termination codons, 
polyadenylation signals and any other sequences necessary or preferred for the appropriate 

5 transcription and subsequent translation of the nucleic acid sequence in the host system- It 
will be understood by one skilled in the art that the correct combination of required or 
preferred expression control elements will depend on the host system chosen. It will further 
be understood that the expression vector should contain additional elements necessary for 
the transfer and subsequent replication of the expression vector containing the nucleic acid 

10 sequence in the host system. Examples of such elements include, but are not limited to, 
origins of replication and selectable markers. It will further be understood by one skilled in 
the art that such vectors are easily constructed using conventional methods ("DNA 
Isolation and Sequencing", Bruce A. Roe, Judy S. Crabtree and Akbar S. Khan, Published by 
John Wiley & Sons, 1996) or are commercially available. 

15 Another aspect of this invention relates to a host organism or a host cell into which a 

recombinant nucleic acid construct comprising the nucleic acid molecules as described 
above, operably linked to an expression vector, has been inserted. The host cells 
transformed with the nucleic acid constructs encompassed by this invention include 
eukaryotes, such as animal, plant, insect and yeast cells and prokaryotes, such as E. coli. 

20 The means by which the nucleic acid construct carrying the nucleic acid sequence may be 
introduced into the cell include, but are not limited to, microinjection, electroporation, 
transduction, or transfection using DEAE-dextran, lipofection, calcium phosphate or other 
procedures known to one skilled in the art (Sambrook et aL (1989) in "Molecular Cloning. 
A Laboratory Manual", Cold Spring Harbor Press, Plainview, New York). 

25 In a preferred embodiment, eukaryotic expression vectors that function in eukaryotic 

cells are used. Examples of such vectors include, but are not limited to, retroviral vectors, 
vaccinia virus vectors, adenovirus vectors, herpes virus vector, fowl pox virus vector, 
plasmids, or the baculovirus transfer vectors. Preferred eukaryotic cell lines include, but are 
not limited to, COS cells, CHO cells, HeLa cells, NIH/3T3 cells, 293 cells (ATCC# 

30 CRL15731), T2 cells, dendritic cells, monocytes or Epstein-15 Barr Virus transformed B 
cells. 

An antigenic peptide of the invention can be obtained, for example, by extraction 
from a natural source (e.g., elution from MHC II molecules); by expression of a 
recombinant nucleic acid encoding the peptide; or by chemical synthesis. A peptide that is 
35 produced in a cellular system different from the source from which it naturally originates is 
"isolated," because it will be separated from components which naturally accompany it. The 
recombinant peptide expressed by a host organism can be obtained as a crude lysate or can 
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be purified by standard protein purification procedures known in the art which may- 
include differential precipitation, size exclusion chromatography, ion-exchange 
chromatography, isoelectric focusing, gel electrophoresis, affinity, and immunoafifinity 
chromatography and the like. The extent of isolation or purity can be measured by any 
5 appropriate method, e.g. mass spectrometry or HPLC analysis. The peptides may be 
prepared synthetically by procedures described in Merrifield, (1986) Science 232: 341-347, 
and Barany andMerrifield, The Peptides, Gross and Meienhofer, eds (N. Y., Academic 
Press), pp. 1-284 (1979). The synthesis can be carried out in solution or in solid phase or 
with an automatized synthesizer (Stewart and Young, Solid Phase Peptide Synthesis, 2nd 
10 ed., Rockford HI., Pierce Chemical Co. (1984)). 

Therefore, the present invention further provides a method for producing a MHC 
class II antigenic peptide comprising (a) at least the amino acid sequence of the peptide 
binding motif selected from the group consisting of SEQ ID NOs. 49 to 57 and SEQ ID 
NOs. 103 to 122, or (b) at least the amino acid sequence of the peptide binding motif 

15 selected from the group consisting of SEQ ID NOs. 49 to 57 and SEQ ID NOs. 103 to 122, 
with additional N-and C- terminal flanking sequences of a .corresponding sequence selected 
from the group consisting of SEQ ID NOs. 1 to 39 and SEQ ID NOs. 58 to 102, comprising 
the steps of culturing the host cell containing a recombinant nucleic acid construct as 
described above under conditions allowing expression of said peptide and recovering the 

20 peptide from the cells or the culture medium. 



In a further embodiment of the present invention, a method is provided for isolating 
and identifying MHC class II associated RA antigenic peptides in femtomolar amounts, 
which method comprises (a) providing immature dendritic cells in a number comprising 

25 0.1 to 5 jig MHC class II molecules; (b) contacting the cells of (a) with serum or synovial 
fluid and inducing maturation of dendritic cells by adding TNFalpha; (c) isolating class II 
MHC molecule-antigenic peptide complexes from the cells with methods comprising 
solubilization of the cells and sequestration of the complexes of MHC class II molecules 
with antigenic peptides by immunoprecipitation or immunoafifinity chromatography; (d) 

30 washing the sequestered complexes of MHC class II molecules with antigenic peptides with 
water in an ultrafiltration tube; (e) eluting the associated antigenic peptides from the MHC 
class II molecules at 37°C with diluted trifluoro acetic acid, and (f) separating, detecting 
and identifying the isolated peptides by liquid chromatography and mass spectrometry. 
Furthermore, in step (f) of the method, the liquid chromatography comprises a first linear 

35 elution step from the reversed-phase material with a volume sufficient to elute the majority 
of contaminants prior to peptide elution. Moreover, the method may further comprise (g) 
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analyzing the identified peptides by methods comprising a database and a software 
developed to perform comparative data analysis across multiple datasets. 

The amount of tissue or bodily fluid necessary to obtain e.g. 100 ng MHC class II 
molecules depends on the number of cells that do express MHC class II and on the 
5 expression rate of MHC class II molecules: e.g. 100 ng of MHC class II are equivalent to 
about 2 x 10 5 mature DCs or 5 to 10 x 10 6 peripheral blood monocytes or about 5 x 10 7 
peripheral blood mononuclear cells which can be obtained from about 50 ml of blood. 

For the purification of class II MHC molecule-antigenic peptide complexes from cells 
or tissue, the membranes of the cells or tissue have to be solubilized. Cell lysis may be 

10 carried out with methods known in the art, e.g. freeze- and- thaw cycles and the use of 
detergents, and combinations thereof. Preferred lysis methods are solubilization using 
detergents, preferably TX- 100, NP40, n-octylglucoside, Zwittergent, Lubrol, CHAPS, most 
preferably XX- 100 or Zwittergent 3-12. Cell debris and nuclei have to be removed from cell 
lysates containing the solubilized receptor-peptide complexes by centrifugation. Therefore, 

15 the complexes of class II MHC molecules with antigenic peptides are isolated from the cells 
with methods comprising solubilization with a detergent 

Furthermore, the MHC class II molecule-peptide complexes are purified from cell 
lysates by methods comprising immunoprecipitation or immunoaffinity chromatography. 
For the immunoprecipitation or immunoaffinity chromatography, antibodies specific for 
20 MHC class II molecules and suitable for these methods are used. The specific antibodies are 
preferably monoclonal antibodies, and are covalently or non-covalently e.g. via Protein A, 
coupled to beads, e.g. sepharose or agarose beads. A selection of the broad panel of anti- 
HLA antibodies used in the prior art comprises: anti-HLA-DR antibodies: L243, TU36, 
DA6.147, preferably L243; anti-HLA-DQ antibodies: SPVL3, TU22, TU169, preferably 
"~ 25 TU22 and TU169; anti-HLA-DP antibody B7/21 and anti-HLA-A,B,C antibodies W6/32 
and B9.12. 

Monoclonal antibodies specific for different MHC class II molecules may be 
commercially obtained (e.g. Pharmingen, Dianova) or purified from the supernatant of the 
respective hybridoma cells using Protein A- or Protein G- affinity chromatography. 
30 Purified monoclonal antibodies may be coupled by various methods known in the art, 
preferably by covalently coupling antibody amino groups to CNBr-activated sepharose. 

Immunoisolation of MHC molecules may be performed by incubating the antibody- 
beads with the cell lysate under rotation for several hours or chromatographically by 
pumping the cell lysate through a micro-column. Washing of the antibody-beads may be 
35 performed in eppendorf tubes or in the microcolumn. The efficacy of the 
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immunoprecipitation may be analysed by SDS-PAGE and western blotting using 
antibodies recognizing denatured MHC molecules (anti-HLA-DRalpha: 1B5; anti-HLA 
class I: HC10 or HCA2). 

The sequestered MHC class II molecule-peptide complexes are washed with water or 
5 low-salt buffer before elution in order to remove residual detergent contaminants. The low 
salt buffer may be a Tris, phosphate or acetate buffer in a concentration range of 0.5 - 10 
mM, preferably in a concentration of 0.5 mM. In a more preferred embodiment, the MHC 
class II molecule -peptide complexes are washed with ultrapure water (sequencing grade) 
conventionally used for HPLC analysis, preferably with ultrapure (sequencing grade) water 

10 from MERCK. The washing step may be carried out by ultrafiltration. The ultrafiltration 
may be carried out in an ultrafiltration tube with a cut-off of 30 kD, 20 kD, 10 kD or 5 kD, 
preferably of 30 kD and a tube volume of 0.5 - 1.0 ml ("Ultrafree" tubes; Millipore). The 
washing in the ultrafiltration tube may be carried out 4 to 12 times, preferably 6 to 10 
times, with a volume of 10 to 20 times the volume of the beads carrying the receptor- 

15 peptide complexes, preferably with a volume of 15 times the beads. The eluted peptides 
maybe separated from the remaining MHC class II molecules using the same.ultrafiltration 
tube. The eluted peptides may then be lyophilized. 

By eluting the peptides from the MHC class II molecules, a complex mixture of 
naturally processed peptides derived from the source of potential antigen and from 
20 polypeptides of intra- or extracellular origin, is obtained. Only after elution, peptides can 
be separated and subjected to sequence analysis. 

The antigenic peptides in the method of the present invention may be eluted by a 
variety of methods known in the art, preferably by using diluted acid, e.g., diluted 
acetonitrile (Jardetzky TS et al., Nature 1991 353, 326-329), diluted acetic acid and heating 
25 (Rudensky AY et al., Nature 1991, 353, 622-626; Chicz RM et al., Nature 1992, 358, 764- 
768) or diluted trifluoro acetic acid at 37°C (Kropshofer H et al., J Exp Med 1992, 175, 
1799-1803). Most preferably, the peptides are eluted at 37°C with diluted trifluoro acetic 
acid. 

The isolated antigenic peptides are then separated, detected and identified. By 
30 detecting it is understood that the amino acid sequence of the individual peptides in the 
mixture of isolated antigenic peptides is elucidated by methods adequate to detect and 
sequence femtomolar amounts of peptides. By identifying it is understood that it is 
established from which proteins or polypeptides the antigenic peptides are derived and 
which sequence they constitute within these proteins or polypeptides. 
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In a first step, the complex mixture of eluted peptides may be separated by one of a 
variety of possible chromatographic methods, e.g. by reversed phase, anion exchange, 
cation exchange chromatography or a combination thereof. Preferably, the separation is 
performed by C18-reverse phase chromatography or by reversed-phase / cation exchange 
5 two-dimensional HPLC, denoted as MudPit (Washburn MP et al, Nat BiotechnoL, (2001), 
19, 242-247). 

The separation is done in a HPLC mode utilizing fused-silica micro-capillary 
columns which are either connected to a nano-flow electrospray source of a mass 
spectrometer or to a micro -fractionation device which spots the fractions onto a plate for 
10 MALDI analysis. 

Liquid chromatography comprises peptide fractionation by the use of a strong ion 
exchange material and a hydrophobic reversed-phase material. For the elution of the 
peptides from the ion exchange and reversed-phase material different elution programs are 
run one after another comprising elutions with salt and with organic solvents, e.g., 

15 acetonitrile. The elution from the reversed-phase material is conducted in several steps of 
linear gradients of different lengths and slopes. A contamination in the sample to be 
fractionated may be any contamination whose elution competes with the detection of the 
peptide peaks in the mass spectrometer. Therefore, in order to prevent simultaneous 
elution, the contaminants have to be eluted with a sufficient solvent volume prior to the 

20 peptide elution step. Depending on the column used for liquid chromatography the solvent 
volume sufficient to elute the contaminants prior to the peptide elution step may be 100 to 
200 times the column volume. 

A variety of mass spectrometric techniques are suitable, preferably MALDI-post 
source decay (PSD) MS or electrospray ionization tandem mass spectrometry (ESI-MS), 
25 most preferably ion-trap ESI-MS. 

The sequences of the individual peptides can be determined by means known in the 
art. Preferably, sequence analysis is performed by fragmentation of the peptides and 
computer-assisted interpretation of the fragment spectra using algorithms, e.g. MASCOT 
or SEQUEST. Both computer algorithms use protein and nucleotide sequence databases to 
30 perform cross-correlation analyses of experimental and theoretically generated tandem 
mass spectra. This allows automated high through-put sequence analysis. 



The isolated and identified antigenic peptides of the invention can be validated by the 
MHC binding motif, the MHC binding capacity and/or by T cell recognition. 
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MHC binding motif 

Peptides associated to a particular MHC molecule (allelic variant) have common 
structural characteristics, denoted as binding motifs, necessary to form stable complexes 
with MHC molecules. Peptide ligands eluted from MHC class I molecules are relatively 
5 short, ranging from 8-11 amino acids. Moreover, 2 or 3 side chains of the peptide are 
relevant for binding. The position of the respective amino acid side chains varies with the 
HLA allele, most often two of these so-called "anchor" residues are located at positions 2 
and 9. With respect to a particular anchor position, only 1 or 2 amino acids normally can 
function as anchor amino acids e.g. leucine or valine V at position 2 in the case of HLA-A2. 

10 In the case of MHC class II molecules, the peptide length varies from 11 to 25 amino 

acids, as longer peptides can bind since both ends of the peptide binding groove are open. 
Most HLA class II molecules accommodate up to 4 anchor residues at relative positions PI, 
P4, P6 and P9 contained in a nonameric core region. This core region, however, can have 
variable distance from the N-terminus of the peptide. In the majority of cases, 2-4 N- 

15 terminal residues precede the core region. Hence, the PI anchor residues is located at 
positions 3, 4 or 5 in most HLA class II associated peptides. Peptides eluted from HLA-DR 
class II molecules share a big hydrophobic PI anchor, represented by tyrosine, 
phenylalanine, tryptophane, methionine, leucine, isoleucine or valine. 

The position and the exact type of anchor residues constitute the peptide binding 
20 motif which is known for most of the frequently occurring HLA class II allelic products. A 
computer algorithm allowing motif validation in peptide sequences is "Tepitope", available 
by vaccinome. 

MHC binding capacity 

Peptides identified by the method of the invention may be tested for their ability to 
25 bind to the appropriate MHC class II molecule by methods known in the art using, for 
example, isolated MHC class II molecules and synthetic peptides with amino acid 
sequences identical to those identified by the method of the invention (Kropshofer H et al., 
J. Exp. Med. 1992; 175, 1799-1803; Vogt AB et al., J. Immunol. 1994; 153, 1665-1673; Sloan 
VS et al., Nature 1995; 375, 802-806). Alternatively, a cellular binding assay using MHC 
30 class II expressing cell lines and biotinylated peptides can be used to verify the identified 
epitope (Axndt SO et al., EMBO J., 2000; 19, 1241-1251) 

In both assays, the relative binding capacity of a peptide is measured by determining 
the concentration necessary to reduce binding of a labelled reporter peptide by 50%. This 
value is called IC50. Peptide binding with a reasonable affinity to the relevant HLA class II 
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molecules attain IC50 values not exceeding 10-fold the IC50 of established reference 
peptides. 

The same binding assays can also be used to test the ability of peptides to bind to 
alternative class II MHC molecules, i.e., class II MHC molecules other than those from 
5 which they were eluted using the method of the invention. The diagnostic methods of the 
invention using such peptides and therapeutic methods of the invention, using either the 
peptides or peptides derived from them, can be applied to subjects expressing such 
alternative class II MHC molecules. 

T cell recognition 

10 The epitope verification procedure may involve testing of peptides identified by the 

method of the invention for their ability to activate CD4+ T cell populations. Peptides with 
amino acid sequences either identical to those identified in the present invention or 
corresponding to a core sequence derived from a nested group of peptides identified in the 
present invention are synthesized. The synthetic peptides are then tested for their ability to 

15 activate CD4+ T cells from (a) test subjects expressing the MHC class. II molecule of 
interest and having at least one symptom of the disease; and (b) control subjects expressing 
the MHC class II molecule of interest and having no symptoms of the disease. Additional 
control subjects can be those with symptoms of the disease and not expressing the MHC 
class II molecule of interest. 

20 In some diseases (e.g., those with an autoimmune component) responsiveness in the 

CD4+ T cells of test subjects but not in CD4+ T cells of the control subjects described in 
(b) provides confirmatory evidence that the relevant peptide is an epitope that activates 
CD4+ T cells that can initiate, promote, or exacerbate the relevant disease. In other diseases 
(e.g., cancer or infectious diseases without an autoimmune component), a similar pattern 

25 of responsiveness and non-responsiveness to that described in the previous sentence would 
indicate that the relevant peptide is an epitope that activates CD4+ T cells that can mediate 
immunity to the disease or, at least, a decrease in the symptoms of the disease. 

CD4+ T cell responses can be measured by a variety of in vitro methods known in the 
art. For example, whole peripheral blood mononuclear cells (PBMC) can be cultured with 

30 and without a candidate synthetic peptide and their proliferative responses measured by, 
e.g., incorporation of [ 3 H] -thymidine into their DNA. That the proliferating T cells are 
CD4+ T cells can be tested by either ehminating CD4+ T cells from the PBMC prior to 
assay or by adding inhibitory antibodies that bind to the CD4+ molecule on the T cells, 
thereby inhibiting proliferation of the latter. In both cases, the proliferative response will be 

35 inhibited only if CD4+ T cells are the proliferating cells. Alternatively, CD4+ T cells can be 
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purified from PBMC and tested for proliferative responses to the peptides in the presence 
of APC expressing the appropriate MHC class II molecule. Such APC can be B- 
lymphocytes, monocytes, macrophages, or dendritic cells, or whole PBMC. APC can also 
be immortalized cell lines derived from B-lymphocytes, monocytes, macrophages, or 
5 dendritic cells. The APC can endogenously express the MHC class II molecule of interest or 
they can express transfected polynucleotides encoding such molecules. In all cases the APC 
can, prior to the assay, be rendered non-proliferative by treatment with, e.g., ionizing 
radiation or mitomycin-C. 

As an alternative to measuring cell proliferation, cytokine production by the CD4+ T 
10 cells can be measured by procedures known to those in art. Cytokines include, without 
limitation, interleukin-2 (IL-2), interferon-gamma (IFN-gamma), interleukin-4 (IL-4), 
TNF-alpha, interleukin-6 (IL-6), interleukin-10 (IL-10), interleukin-12 (IL-12) or TGF- 
beta. Assays to measure them include, without limitation, ELISA, and bio-assays in which 
cells responsive to the relevant cytokine are tested for responsiveness (e.g., proliferation) in 
15 the presence of a test sample. 

Alternatively, cytokine production by CD4+ lymphocytes can be directly visualized 
by intracellular immunofluorescence staining and flow cytometry. 

Moreover, the MHC class II antigenic peptides of the present invention may be used 
20 in the diagnosis of RA. Therefore, a further embodiment of the invention is the use of an 
antigenic peptide according to the present invention as a marker for RA. 

Preferably, a MHC class II antigenic peptide comprising (a) at least the amino acid 
sequence of the peptide binding motif selected from the group consisting of SEQ ID NOs. 
49 to 57 and SEQ ID NOs. 103 to 122, or (b) at least the amino acid sequence of the 
25 peptide binding motif selected from the group consisting of SEQ ID NOs. 49 to 57 and 
SEQ ID NOs. 103 to 122, with additional N-and C-terminal flanking sequences of a 
corresponding sequence selected from the group consisting of SEQ ID NOs. 1 to 39 and 
SEQ ID NOs. 58 to 102, is used as a marker for RA. 

In another embodiment, the antigenic peptides of the invention may be used as 
30 response markers to track the efficacy of a therapeutic regime. Essentially, a baseline value 
for an antigenic peptide can be determined, then a given therapeutic agent is administered, 
and the levels of the antigenic peptide are monitored subsequently, whereas a change in the 
level of the antigenic peptide is indicative of the efficacy of a therapeutic treatment. 



WO 2005/014622 



29 



PCT/EP2004/008609 



Furthermore, the antigenic peptides which are only found in certain stages or phases 
of a disease, preferably of RA, may be utilized as stage-specific markers. Essentially, the 
levels of the antigenic peptides which have been linked to a certain disease stage are 
monitored regularly, thereby providing information about the stage of the disease and its 
5 progression. 

The invention also includes the use of the polypeptides the RA antigenic peptides are 
derived from as markers for the diagnosis and monitoring of a disease, preferably of RA, 
and in particular, of erosive versus non-erosive RA. The rationale for the use of the 
respective proteins is that DCs reside in most tissues where they capture exogenous 

10 antigens via specific receptors and via specialized endocytotic mechanisms (e.g. 
macropinocytosis) followed by presentation of the processed antigens as peptides on MHC 
class II molecules. Previous studies have shown that the frequency of a peptide epitope 
found in the context of MHC class II molecules, e.g. the RA antigenic peptides, in the 
majority of cases mirrors the abundance of the protein from which this particular peptide 

15 was derived from. Therefore, not only the RA antigenc peptides but also the corresponding 
proteins can serve as markers for RA. 

Therefore, in a further embodiment of the present invention, the use of a polypeptide 
selected from the group consisting of interferon-gamma-inducible lysosomal thiol 
reductase (SEQ ID NO: 40), integrin beta-2 (SEQ ID NO. 123), phosphatitylinositol-4,5- 

20 bisphosphate 3-kinase (SEQ ID NO: 124), urokinase-type plasminogen activator (SEQ ID 
NO. 125), immunoglobulin heavy chain V-III region (V H 26) (SEQ ID NO. 126), DJ-1 
protein (SEQ ID NO. 127), apolipoprotein B-100 (SEQ ID NO: 41), 26S proteasome non- 
ATPase regulatory subunit 8 (SEQ ID NO. 128), interleukin-1 receptor (SEQ ID NO: 129), 
fibromodulin (SEQ ID NO. 130), GM-CSF/IL-3/IL-5 receptor (SEQ ID NO. 131), sorting 

25 nexin 3 (SEQ ID NO. 132), inter-alpha-trypsin inhibitor heavy chain H4 (SEQ ID NO: 42), 
complement C4 (SEQ ID NO: 43), complement C3 (SEQ ID NO: 44), SH3 domain- 
binding glutamic acid-rich-like protein 3 (SEQ ID NO: 45), interleukin-4-induced protein 
1 (SEQ ID NO: 46), hemopexin (SEQ ID NO: 47), Hsc70-interacting protein (SEQ ID NO: 
48), invariant chain (Ii) (SEQ ID NO. 133), retinoic acid receptor responder protein 2 

30 (SEQ ID NO. 134), fibronectin (SEQ ID NO. 135), cathepsin B (SEQ ID NO: 136), 
tripeptidyl-peptidase II (SEQ ID NO. 137), legumain (SEQ ID NO. 138), platelet activating 
factor receptor (SEQ ID NO. 139), poly-alpha-2.8-sialyltransferase (SEQ ID NO. 140), ras- 
leated protein Rab-llB (SEQ ID NO. 141) as a marker for RA is provided. Preferably, the 
polypeptide is used as a marker for erosive RA. It is also preferred to use the polypetide as a 

35 marker for non-erosive RA. Especially preferred is the use of interleukin-4-induced protein 
1 (SEQ ID NO: 46) as a marker for RA. The Figl polypeptide has not been known as a 
marker for RA until now, and is considered as an important candidate marker for RA. 
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The diagnosis of RA can be made by examining expression and/or composition of a 
polypeptide or peptide marker for RA, by a variety of methods, including enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunopretipitations and 
immunofluorescence. A test sample from an individual is assessed for the presence of an 

5 alteration in the expression and/or an alteration in composition of a polypeptide or a 
peptide of the present invention. An alteration in expression of a polypeptide or peptide 
can be, for example, an alteration in the quantitative polypeptide expression (i.e., the 
amount of polypeptide produced); an alteration in the composition of a polypeptide is an 
alteration in the qualitative polypeptide expression (e.g., expression of a mutant 

10 polypeptide or of a different splicing variant). 

Both such alterations (quantitative and qualitative) can also be present. An 
"alteration" in the polypeptide expression or composition, as used herein, refers to an 
alteration in expression or composition in a test sample, as compared with the expression 
or composition of the peptide or polypeptide in a control sample. A control sample is a 

15 sample that corresponds to the test sample (e.g., is from the same type of cells), and is from 
an individual who is not affected by RA. An alteration in the expression or composition of 
the peptide or polypeptide in the test sample, as compared with the control sample, is 
indicative of RA or a susceptibility to RA. Various means of examining expression or 
composition of a peptide or polypeptide of the present invention can be used, including 

20 spectroscopy, colorimetry, electrophoresis, isoelectric focusing, and immunoassays (e.#, 
David et al> U.S. Pat No. 4,376,110) such as immunoblotting (see also Current Protocols 
in Molecular Biology, particularly chapter 10). For example, in one embodiment, an 
antibody capable of binding to the polypeptide (e.g., as described above), preferably an 
antibody with a detectable label, can be used. Antibodies can be polyclonal, or more 

25 preferably, monoclonal. An intact antibody, or a fragment thereof (e.g. y Fab or F(ab')2) can 
be used. The term labeled", with regard to the probe or antibody, is intended to 
encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) a 
detectable substance to the probe or antibody, as well as indirect labeling of the probe or 
antibody by reactivity with another reagent that is directly labeled. Examples of indirect 

30 labeling include detection of a primary antibody using a fluorescently labeled secondary 
antibody and end-labeling of a DNA probe with biotin such that it can be detected with 
fluorescently labeled streptavidin. 

Western blotting analysis, using an antibody as described above that specifically binds 
to a peptide or polypeptide of the present invention, may be used to measure the level or 
35 amount of a peptide or polypeptide in a test sample and comparing it with the level or 
amount of the peptide or polypeptide in a control sample. Preferably the peptide or 
polypeptide in a test sample is measured in a homogenous or a heterogenous immuno 
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assay. A level or amount of the polypeptide in the test sample that is higher or lower than 
the level or amount of the polypeptide in the control sample, such that the difference is 
statistically significant, is indicative of an alteration in the expression of the polypeptide, 
and is diagnostic for a RA or a susceptibility to RA. 

5 Therefore, the present invention also relates to a diagnostic composition comprising 

an antibody reactive with a MHC class II antigenic peptide of the invention. 



In a further embodiment the antigenic peptides of the invention or the proteins they 
are derived from may be used in the prevention and treatment of a disease, preferably of 
10 RA. 

One aspect of the invention is a therapeutic purpose, wherein one or more of the 
identified antigenic peptides are used to vaccinate patients against RA, preferably against 
erosive and/or non-erosive RA. In the course of the vaccination the antigenic peptide 
would induce an antigen-specific T cell tolerance in the patient which would ultimately 
15 lead to regression of the disease or to an attenuation of disease development. 

A promising strategy to induce specific immune tolerance in future clinical trials is 
the use of DNA tolerizing vaccines. DNA tolerizing vaccines encoding autoantigens alone 
were shown to reduce T cell proliferative responses (Ruiz, P. et aL, J Immunol 162 (1999) 
3336-3341), while DNA tolerizing vaccines co-delivering autoantigen plus IL-4 also 

20 induced protective T H 2 responses (Garren, H. et aL, Immunity 15 (2001) 15-22). Examples 
of non-polynucleotide-specific tolerizing therapies under development include protein 
antigens, naturally processed peptides, altered peptide ligands, other biomolecules, such as 
DNA, or proteins and peptides containing posttranslational modifications, and antigens 
delivered orally to induce "oral tolerance" (reviewed in: Robinson, W.H. et aL, Clin 

25 Immunol 103 (2002) 7-12). A potential adverse effect with regard to tolerizing therapies is 
the development of autoimmunity. 

To this end, the relevant RA antigenic peptides may be directly administered to the 
patient in an amount sufficient for the peptides to bind to the MHC molecules, and 
provoke peripheral tolerance of T cells. 

30 Alternatively, the antigenic peptides of the invention may be utilized for the 

generation of vaccines based on DCs. In this case, autologous DCs derived from patients' 
monocytes may be pulsed with the relevant peptides or recombinant proteins containing 
the relevant peptide sequences. 



WO 2005/014622 



PCT/EP2004/008609 



32 

Therefore, the present invention provides a pharmaceutical composition comprising 
a MHC class II antigenic peptide comprising (a) at least the amino acid sequence of the 
peptide binding motif selected from the group consisting of SEQ ID NOs. 49 to 57 and 
SEQ ID NOs. 103 to 122, or (b) at least the amino acid sequence of the peptide binding 
5 motif selected from the group consisting of SEQ ID NOs. 49 to 57 and SEQ ID NOs. 103 to 
122, with additional N-and C-terminal flanking sequences of a corresponding sequence 
selected from the group consisting of SEQ ID NOs. 1 to 39 and SEQ ID NOs. 58 to 102, an 
antibody reactive with said antigenic peptide, or a polypeptide selected from the group 
consisting of SEQ ID NOs 40 to 48 and SEQ. ID NOs. 123 to 141, and optionally a 
10 pharmaceutically acceptable excipient, diluent or carrier. The antigenic peptide has to be 
present in an amount sufficient to tolerize specific lymphocytes. Such an amount will 
depend on the peptide used, the administration, the severity of the disease to be treated and 
the general conditions of the patient and will usually range from 1 to 50 mg/ml, for 
example in case of peptides being loaded on dendritic cells. 

15 An acceptable excipient, diluent or carrier may be phosphate buffered saline for in 
vitro studies and physiological salt solutions for in vivo applications. 

"Vaccination" herein means both active immunization, i. e. the in vivo administration 
of the peptides to elicit an in vivo immune tolerance directly in the patient and passive 
immunization, i. e. the use of the peptides to tolerize in vitro CD4+ T lymphocytes or to 
20 stimulate autologous or allogeneic dendritic cells, which are subsequendy re-inoculated 
into the patient 

The present invention also provides the antigenic peptides, antibodies, nucleic acids, 
host cells, methods, compositions and uses substantially as herein before described 
especially with reference to the Examples. 

25 Having now generally described this invention, the same will become better 

understood by reference to the specific examples, which are included herein for purpose of 
illustration only and are not intended to be limiting unless otherwise specified, in 
connection with the following figures. 



WO 2005/014622 



33 



PCT/EP2004/008609 



Examples 

The examples below are illustrated in connection with the figures described above 
and based on the methodology summarized in Fig 1, as described in the following. 
Commercially available reagents referred to in the examples, were used according to 
5 manufacturer's instructions unless otherwise indicated. 



Methodology of the invention 
Dendritic cells and culturing 

The study was performed with human dendritic cells which were differentiated from 
10 monocytes, as described below. Monocytes were purified from human peripheral blood. 
The blood was taken from healthy donors with the following haplotypes: (l)HLA- 
DRB1*0401, *03011 y (2) HLA-DRB1*0401, *0304, (3) HLA-DRB1*0401, *I30I, (4) HIA- 
DRB1*0401, *0701, HLA-DRB1*0401, *0407. 

All cells were cultured in RPMI 1640 medium (short: RPMI) supplemented with 1 
15 mM Pyruvate, 2 mM Glutamine and 10% heat-inactivated fetal calf serum (Gibco BRL, 
Rockville, MD). 

Isolation of peripheral blood mononuclear cells (PBMCs) 

Peripheral blood was obtained from the blood bank in Mannheim, Germany as . 
standard buffy coat preparations from healthy donors. Heparin (200 I.U./ml blood, 

20 Liquemine, Roche) was used to prevent clotting. Peripheral blood mononuclear cells 
(PBMCs) were isolated by centrifugation in LSM® (1.077-1.080 g/ml; ICN, Aurora, OH) at 
800g (room temperature) for 30 min. PBMCs were collected from the interphase and 
washed twice in RPMI containing 20 mM Hepes (500g for 15 min, 300g for 5 min). In 
order to remove erythrocytes, PBMCs were treated with ALT buffer (140 mM ammonium 

25 chloride, 20 mM Tris, pH 7.2) for 3 min at 37°C. PBMCs were washed twice with RPMI 
containing 20 mM Hepes (200g for 5 min). 

Generation of dendritic cells from peripheral blood monocytes. 

Monocytes were isolated from PBMCs by positive sorting using anti-CD 14 magnetic 
beads (Miltenyi Biotech, Auburn, CA) according to the manufacturer's protocol. 
30 Monocytes were cultured in RPMI supplemented with 1% non-essential amino acids 
(Gibco, BRL, Rockville, MD), 50 ng/ml recombinant human granulocyte macrophage- 
colony stimulating factor (GM-CSF; S.A. l.lxlO 7 U/mg) (Leucomax; Novartis, Basel 
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Switzerland) and 3 ng/ml recombinant human IL-4 (S.A. 2.9x10 4 U/(Xg) (RScD Systems, 
Minneapolis, MN). Monocytes were seeded at 0.3 x 10 6 /ml in 6-well plates (Costar) for 5 
days to obtain immature dendritic cells. 

The quality of monocyte-derived immature dendritic cells was routinely monitored 
5 by flow-cytometric analysis and assessed to be appropriate when they displayed the 
following phenotype: CDla (high), CD3 (neg.), CD14 (low), CD19 (neg.), CD56 (neg.)> 
CD80 (low), CD83 (neg.), CD86 (low) and HLA-DR (high). In contrast, mature dendritic 
cells (cf. below) display the following phenotype: CDla (low), CD80 (high), CD83 (high), 
CD86 (high) and HLA-DR (high). Monoclonal antibodies against CDla, CD3, CD14, 
10 CD19, CD56, CD80, CD83, CD86 as well as the respective isotype controls were purchased 
from Pharmingen (San Diego, CA). 

Exposure of dendritic cells to serum or synovial fluid 

Serum and synovial fluid were irradiated for 30 min with 137 Cs (70 TBq). To feed 
dendritic cells with serum- or synovia-derived antigen, 6 x 10 6 immature dendritic cells 
15 were pulsed with either 1 ml serum or 0.6 ml synovial fluid. At the same time maturation of 
dendritic cells was induced by adding 10 ng/ml recombinant human tumor necrosis factor 
alpha (TNFa; S.A. 1.1x10 s U/fig). As a control, 6 x 10 6 immature dendritic cells were 
incubated with TNFa alone. 

After 24 hrs in culture, mature dendritic cells were harvested by centrifugation at 
20 300g for 10 min. Cells were washed with PBS and transferred to an eppendorf tube. After 
centrifugation at 400g for 3 min, the supernatant was completely removed and the cells 
were frozen at -70°C. 

Generation of anti-HLA class II beads 

The anti-HLA-DR monoclonal antibody (mAb) L243 (ATCC, Manassas, VA) was 
25 produced by culturing the respective mouse hybridoma cell line. mAb L243 was purified 
using ProteinA sepharose (Pharmacia, Uppsala, Sweden) and immobilized to CNBr- 
activated sepharose beads (Pharmacia) at a final concentration of 2.5 mg/ml, according to 
the manufacturer's protocol. L243 beads were stored in PBS containing 0.1% Zwittergent 
3-12 (Calbiochem, La Jolla, CA). 
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Nano-scale purification of HLA-DR-peptide complexes 

Pellets of frozen dendritic cells were resuspended in 10-fold volume of ice cold lysis 
buffer (1% Triton-X-100, 20 mM Tris, pH 7*8, 5 mM MgCl 2 , containing protease 
inhibitors chymostatin, pepstatin, PMSF and leupeptin (Roche, Mannheim, Germany)) 

5 and lysed in a horizontal shaker at 1000 rpm, 4°C for 1 h. The cell lysate was cleared from 
cell debris and nuclei by centrifiigation at lOOOOg, 4°C for 10 min. The lysate was co- 
incubated with L243 beads (5-10 pi L243 beads per 100 |il cell lysate) in a horizontal shaker 
at 1000 rpm, 4°C for 2 hrs. Immunoprecipitated HLA-DR-peptide complexes bound to 
L243 beads were sedimented by centrifiigation at lOOOg, 4°C for 1 min and washed four 

10 times with 500 \A 0.1% Zwittergent 3-12 (Calbiochem) in PBS. 

The efficacy of depletion of HLA-DR-peptide complexes was monitored by analyzing 
the respective cell lysates before and after immunoprecipitation and aliquots of the beads 
by western blotting using the anti-HLA-DRa-specific mAb 1B5 (Adams, T.E. et al., 
Immunology 50 (1983) 613-624). 

15 Elution of HLA-DR-associated peptides 

HLA-DR-peptide complexes bound to L243 beads were resuspended in 100 |il H2O 
(HPLC-grade; Merck, Darmstadt, Germany), transferred to an ultrafiltration tube, 
Ultrafree MC, 30 kD cut-off (Millipore, Bedford, MA) and washed 10 times with 100 |il 
H 2 0 (HPLC-grade) by centrifiigation for 1-2 min at lOOOOg at RT. For eluting the bound 
20 peptides, 60 [A 0.1% trifluoracetic acid (Fluka, Buchs, Switzerland) in H 2 0 (HPLC-grade) 
was added and incubation was performed for 30 min at 37°C Eluted peptides were 
collected in a new eppendorf tube by centrigugation of the Ultrafree unit at lOOOOg for 3 
min at RT and immediately lyophilized in a Speed- Vac™ vacuum centrifuge. 

Fractionation by two-dimensional nanoflow LC 

25 To perform high-throughput sequencing of complex peptide mixtures, the MudPIT 

(multidimensional protein identification technology) was used (Washburn, M.P. et al., Nat 
Biotechnol 19 (2001), 242-247) which is based on liquid chromatographic fractionation 
followed by mass spectrometric sequence determination. 

To this end, lyophilized peptides eluted from HLA molecules were resuspended in a 
30 buffer containing 5% (v/v) acetonitrile (ACN), 0.5% (v/v) acetic acid, 0.012% (v/v) 
heptafluoro butyric acid (HFBA) and 1% (v/v) formic acid. The peptide mixture was 
fractionated on a fused-silica microcapillary column (100 fim i.d. X 375 [im) generated by a 
Model P-2000 laser puller (Sutter Instrument Co., Novato, CA). The microcolumn was 
packed with 3 ^im / C18 reversed-phase material (C18-ACE 3 Jim [ProntoSIL 120-3-C18 
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ACE-EPS, Leonberg, Germany]) followed by 3 cm of 5 \xm cation exchange material 
(Partisphere SCX; Whatman, Clifton, USA). 

A fully automated 8-step gradient separation on a LC Packings UltiMate HPLC (LC 
Packings, San Francisco, USA) was carried out, using the following buffers: 5% ACN / 
5 0.012% HFBA / 0.5% acetic acid (buffer A), 80% ACN / 0.012% HFBA / 0.5% acetic acid 
(buffer B), 250 mM ammonium acetate / 5% ACN / 0.012% HFBA / 0.5% acetic acid 
(buffer C), and 1.5 M ammonium acetate / 5% ACN / 0.012% HFBA / 0.5% acetic acid 
(buffer D). The first 116 min step consisted of a 75 miri gradient from 0 to 40% buffer B 
followed by a 10 min gradient from 40 to 80% buffer B, a 6 min hold at 80% buffer B and a 

10 10 min equilibration step with 100% buffer A. The next 5 steps (146 min each) were 
characterized by the following profile: 5 min 100% buffer A, 5 min gradient from 0 to x% 
buffer C, 5 min 100% buffer A, 30 min gradient from 0 to 10% buffer B, 55 min gradient 
from 10 to 35% buffer B, 20 min gradient from 35 to 50% buffer B, 10 min gradient from 
50 to 80% buffer B, a 6 min hold at 80% buffer B, and a 10 min equilibration step with 

15 100% buffer A. The buffer C percentages (x) in steps 2-6 were as follows: 20, 40, 60, 80, and 
90%. The 30 min gradient from 0 to 10% buffer B, which is the first linear elution step 
from the reversed-phase material, was needed in order to sufficiently separate peptide 
elution from the elution of a major contaminant (m/z=945) which otherwise would have 
led to the loss of the more hydrophilic peptide peaks. Step 7 consisted of the following 

20 profile: 5 min 100% buffer A, 20 min 100% buffer C, 5 min gradient from 0 to 10% buffer 
B, 35 min gradient from 10 to 35% buffer B, 50 min gradient from 35 to 50% buffer B, 
10 min gradient from 50 to 80% buffer B, a 5 min hold at 80% buffer B and a 10 min 
equilibration step with 100% buffer A. Step 8 was identical to step 7 with the exception of 
using buffer D instead of buffer C. 

25 Ion trap MS/MS mass spectrometry 

The HPLC column was directly coupled to a Finnigan LCQ Deca XP Plus ion trap 
mass spectrometer (Thermo Finnigan, San Jose, USA) equipped with a nano-LC 
electrospray ionization source. Mass spectrometry in the MS/MS mode was performed 
according to the manufacturer's protocol. Peptides were identified by the SEQUEST 
30 algorithm (U.S. patents 6,017,693 and 5,538,897). 
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MALDI-TOF mass spectrometry 

Peptides spotted onto an AnchorChip plate were co-cristallized with matrix 
(5 mg/ml; a-cyano-4-hycfroxy-cinnamic acid (Merck, Darmstadt, Germany), 50% 
acetonitrile, 0.1% trifluoroacetic acid). For qualitative analysis of the whole peptide 
5 repertoire, samples were analyzed on an Ultraflex™ MALDI-TOF mass spectrometer 
(Bruker, Bremen, Germany), according to the manufacturer's protocol. 

Sequence identification by SEQUEST and differential dataset analysis. 

MS/MS fragmentation data were analyzed with the software SEQUEST (Thermo 
Finnigan, San Jose, USA). From an in-house protein database, which was created based on 
10 the public databases Swiss-Prot and TrEMBL, SEQUEST extracted for each spectrum all 
peptide sequences that corresponded to the molecular mass of the parent ion and measured 
the degree of similarity between the experimental spectrum and the theoretical, in silico 
generated, spectrum. Only the top-scoring candidate sequence was listed. 

The peptide sequences derived from the SEQUEST analysis and their accompanying 
15 information on mass accuracy, scoring parameters and peptide origin were stored in an 
appropriately designed relational database and further processed. Certain constraints were 
enforced in order to guarantee the storage of only significant sequences with satisfying 
SEQUEST scores. The two most important constraints were: (i) keep only those sequences 
that have a cross correlation coefficient (CC) higher than a certain value and (ii) from the 
20 remaining sequences keep those ones which have a predefined delta cross correlation 
coefficient (ACC). For both criteria the minimum chosen values are based on empirical 
knowledge of interpreting SEQUEST results. 

A dataset was defined as the sum of data from a particular set of spectra. The design 
of database and software allowed queries on a single dataset as well as comparisons of 

25 multiple datasets. Such a database and software design enables comparative sample 
analysis, which is not provided by SEQUEST. For instance, possible queries on a single 
dataset could provide information on the score distribution among the stored spectra, on 
the existence of further sequence length variants or common subsequences, or on the 
protein origin of peptide sequences. Since the occurrence of truncation variants of the same 

30 epitope is a general characteristic of class II MHObound peptides, the existence of length 
variants in a dataset provides additional strong evidence for the presence of an epitope in a 
set of spectra. 

The most important feature in the analysis of multiple datasets is the possibility to 
extract a common subset of sequences that satisfies a given criterion. Such a criterion could 
35 be based on sequence similarity, e.g., within all sequences of a collection of datasets, those 
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sequences were selected that had at least one subsequence in common with any other 
sequence. Such comparisons across different datasets constitute the differential approach 
(RA samples versus control samples) and thereby optimize the search for candidate RA 
marker peptides. 

5 The pairwise similarity scores between sequences were calculated by a software 

routine, which is an implementation of a standard string-comparison algorithm. 
Subsequently, these scores were used to group closely-related sequences (sequences sharing 
a common subsequence) in well-separated clusters by an additionally developed software 
routine, which is based on a well-established algorithm (hierarchical clustering, UPGMA). 

10 The generated clusters (e.g. of peptide truncation variants) were then used, to identify 

closely-related sequences across different datasets. 

Overall, the data evaluation software provided the ability to perform swiftly and 
reproducibly the following: 

-Select from the sequence output generated by SEQUEST those sequences that satisfy 
15 reliable empirical criteria. 

- Store the data in a database appropriately designed for the discovery process at 

hand. 

-Extract information about the sequence content of each stored dataset. This 
information is valuable in assessing the importance of individual sequences within the 
20 given dataset and, consequently, across multiple datasets. 

-Provide, by virtue of the multiple dataset comparisons, a tool that realizes the 
differential approach, namely the study of the actual sequence content of one sample versus 
other(s). 



25 Purification of HLA-DR molecules 

HLA-DR molecules were purified from 10 10 EBV-transformed B cell lines or T2- 
transfectants by affinity chromatography, using anti-DR monoclonal antibody L243, as 
previously described (Kropshofer H. et al., PNAS 92 (1995) 8313-8317). 



30 



In vitro peptide binding assay 
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HA(307-319), PKYVKQNTLKLAT, is an immunodominant epitope from influenza 
virus hemagglutinin that binds well to HLA-DR4 molecules and was used as a reporter 
peptide in an in vitro peptide binding assay (Rothbard, J.B. et al., Cell (1988) 52:515-523). 

Purified detergent-solubilized HLA-DR4 molecules (200 nM) were co-incubated 
with biotinylated HA(307-319) peptide (200 nM) and graded amounts of competitor 
peptide (100 nM - 10 fjM) for 24 hrs at 37°C in binding buffer (50 nM sodium phosphate, 
50 mM sodium citrate, pH 4.8, 0.1% Zwittergent 3-12) in a total volume of 50 fA. The 
competitor peptides were derived from the candidate RA antigens identified in this study 
and purchased as synthetic peptides from Medprobe (Lund, Sweden). 

3 x 10 were then diluted 10-fold in PBS containing 0.05% Tween-20 and 1% BSA 
and incubated for 2 hrs in a microtiterplate (Nalge Nunc), which has been coated over 
night with anti-DR monoclonal antibody L243. Afterwards the samples were developed by 
incubation with 0.1 //g/ml EU-labeled streptavidin (Wallay Oy, Turku, Finland) for 45 min 
according to the manufacturer's protocol. After intensive washing with 0.05% Tween-20 in 
PBS, Europium fluorescence was measured with a time-resolved fluorometer (VICTOR 
1420, Wallac/Perkin Elmer Life Sciences) to quantify the binding of biotinylated HA(307- 
319) peptide to HLA-DR4 molecules (Arndt, S.O. et al., EMBO J. 19 (2000) 1241-1251). 
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Example 1 

In this example, the technique mentioned in Figure 1 was used to identify novel 
HLA-DR-associated peptide markers derived from serum and synovial fluid of patients 
with non-erosive RA. 

5 6 x 10 6 immature dendritic cells were pulsed with either 1 ml serum (5 samples) or 

0.6 ml synovial fluid (2 samples) of patients with non-erosive RA and cultured for 24 hrs in 
the presence of 10 ng/ml TNFoc. As a control, 6 x 10 6 dendritic cells were cultured in the 
presence of TNFoc (10 ng/ml) without adding serum but 1 ml of PBS. In an additional 
experiment 6 x 10 6 dendritic cells were pulsed with 1 ml serum from 2 healthy test persons 

10 and cultured for 24 hrs in the presence of TNFoc (10 ng/ml). 

Dendritic cells were lysed in detergent TX-100 and HLA-DR molecules were isolated 
using mAb L243. HLA-DR-associated peptides were eluted with 0.1% TFA and analyzed by 
high-throughput 2D-LC-MS/MS technology. Peptide identification was achieved by using 
the SEQUEST algorithm. The peptide sequences derived from the SEQUEST analysis and 
15 accompanying information on mass accuracy, scoring parameters and peptide origin were 
stored in a database and further processed. 

The peptide sequences identified from impulsed DCs (control 1) and from DCs 
pulsed with the serum of healthy test persons (control 2) were compared with the peptide 
sequences identified from DCs pulsed with the serum of non-erosive RA patients. Among 
20 the RA-specific sequences, only those peptides were selected for further evaluation that re- 
occurred in at least three of five non-erosive RA samples. 

In each serum sample roughly 600±150 individual peptide sequences (cross 
correlation coefficient CC > 3.0 and ACC > 0.15) were identified. In the synovia samples 
the number of individual peptide sequences was slightly smaller (400±30). Approximately 

25 80-85% of the peptides found in RA samples were also identified in control samples, 
underlining the high reproducibility of the analysis. In the majority of cases, several length 
variants of the same epitope could be identified which is a typical characteristic of class II 
MHC-bound antigens and supports the validity of the results (Jones, E.Y., Curr Opin 
Immunol 9 (1997) 75-79). Further confidence in the quality of the data relies on the fact 

30 that several of the identified peptides or proteins have already been described in the context 
of MHC class II molecules: epitopes derived from ubiquitous proteins like Hsp70, enolase, 
annexin II, cathepsin C or collagen II, as well as from MHC molecules (HLA-A, -B, -C, -E, 
-G, and p 2 -microglobulin) and CLIP (Chicz, R.M. et al., J Exp Med 178 (1993) 27-47; 
Sinigaglia, F. & Hammer, J., Curr Opin Immunol 6 (1994) 52-56; Arnold-Schild, D. et al., J 
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Immunol 162 (1999) 3757-3760; Vogt, A.B. & Kropshofer, H., Trends Biochem Sci 4 
(1999) 150-154) were frequently detected. 

RA-specific peptide sequences were further validated with regard to binding to the 
RA susceptibility allele D£B1 *0401 by using the TEPITOPE software (Hammer, J. et al., 
5 Adv Immunol 66 (1997) 67-100). This software provides means for the qualitative and 
quantitative prediction of T cell epitopes. 

The output of the study consists of an epitope that occurred, apart from one 
exception, only in non-erosive RA samples (Table 1). 



1 0 Interferon-gamma-inducible lysosomal thiol reductase 

A very interesting epitope which was identified in 3 out of 7 non-erosive RA samples 
from serum and synovia is derived from the interferon-gamma-inducible lysosomal thiol 
reductase (GILT): the 16-mer GILT (192-207) with the amino acid sequence of SEQ ID 
NO: 3 (Table 1). Further length variants in three other samples support the relevance of 
15 this epitope (Table 1): the 14-mer GILT (192-205; SEQ ID NO: 1) and the 17-mer GILT 
(192-208; SEQ ID NO: 2). 

As judged from the shortest length variant, GILT (192-205), the epitope contains a 
suitable binding motif, with regard to binding to the RA susceptibility allele DRB 1*0401: 
196M serves as a PI anchor, 199M as a P4 anchor and 201A as a P6 anchor. According to 
20 TEPITOPE scoring, the epitope has a binding score (threshold value) of 1% which is 
similar to the binding score of an epitope from influenza haemagglutinin (307-319) that 
was shown to be a strong DRB1*0401 binder (Table 1) (Rothbard, J.B. et al., Cell 52 (1988) 
515-523). 

GILT is constitutively expressed in antigen-presenting cells, such as dendritic cells, 
25 macrophages and B cells, and facilitates unfolding of endocytosed antigens in MHC class 
II-containing compartments (MIIC) by enzymatically reducing disulfide bonds (Phan, 
U.T. et al., J Biol Chem 275 (2000) 25907-25914). Direct binding of GILT to HLA-DR 
molecules has been reported for B cells (Arunachalam, B. et al., J Immunol 160 (1998) 
5797-5806). A rather long second epitope of GILT was found to bind to HLA-DR3 
30 molecules: the 22 -mer GILT (38-59) having the amino acid sequence 
SPLQALDFFGNGPPVNYKTGNL (Chicz, R.M. et al., J Exp Med 178 (1993) 27-47). 

In addition to GILT (192-207), another epitope of the same protein was identified in 
several RA samples, but also in control samples: GILT (210-227) with the amino acid 
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sequence QPPHEYWWVTVNGKPLE. This epitope was accompanied by 3 other length 
variants: the 16-mer GILT (210-225), the 17-mer GILT (210-226) and the 19-mer GILT 
(210-228). 

As indicated by its name, GILT expression can be induced by the pro-inflammatory 
5 cytokine interferon gamma (IFN-y) in various types of cells, including macrophages, 
endothelial cells and fibroblasts (Luster, A.D. et al., J Biol Chem 263 (1988) 12036-12043). 
As IFN-Y is known to be present in inflamed joints of RA patients, GILT could become 
over-expressed in synovia and serum and, hence, could be taken up by DCs as an 
exogenous antigen. GILT (192-207) maybe derived from exogenous GILT. The other GILT 
10 epitope, which is also present in the control samples, may be derived from endogenous 
GILT, expressed by DCs. Alternatively, both GILT (192-207) and GILT (210-227) maybe 
derived from endogenous GILT, in case that GILT processing and GILT-derived epitope 
presentation by DCs were critically altered upon contact with RA-associated material. 

The identified epitope of Interferon-gamma-inducible lysosomal thiol reductase 
15 GILT (192-205) which has been described already in detail, was further analysed in an 
in vitro binding assay using synthetic GILT (192-205) peptide and purified HLA-DR4 
molecules (Fig. 3): In agreement with TEPITOPE scoring, the peptide was shown to bind to 
HLA-DR4 with high affinity, comparable to the viral HA(307-319) peptide. 



20 Integrinbeta-2 

The extended analysis revealed the presence of an epitope which was identified in two 
out of seven non-erosive RA samples from serum and synovia and which is derived from 
the beta subunit of Integrin (ITB2): the 17-mer ITB2 (315-331; SEQ. ID NO: 58) with the 
amino acid sequence NIQPIFAVTSRMVKTYE (Table 1). One length variant was found: 
25 the 19-mer ITB2 (313-331; SEQ. ID NO: 59) supporting the validity of the identified 
epitope (Table 1). 

The peptide sequence contains a moderate HLA~DRB1*0401 binding motif with 3161 
and 3191 serving as PI and P4 anchor, respectively (TEPITOPE binding score: 2%). 

Integrins are a family of cell surface receptors that play important roles in 
30 embryogenesis, wound healing, immune responses, and cell adhesion. ITB2 is a 
heterodimeric receptor for intercellular and vascular adhesion molecules (ICAMs and 
VCAMs) and exclusively expressed on leukocytes. ICAMs and VCAMs are members of the 
immunoglobulin superfamily that has a central function in humoral and cell-mediated 
immune responses. Many cytokines, such as interferon-Y> IL-1 and TNFa, which are 
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upregulated in inflammatory diseases like RA, induce expression of ICAMs on the surface 
of endothelial cells. It was shown that the expression of ICAM-1 and VCAM-1 is higher in 
synovial tissue of RA patients as compared to osteoarthritis patients (Furuzawa-Carballeda, 
J. et aL, Scand J Immunol^O (1999) 215-222). 

5 Binding of ITB2 and other integrins to ICAMs allows leukocytes to infiltrate into 

their target tissues, e.g. the sites of inflammation. In order to function in a non-adherent or 
circulating mode, leukocytes constitutively express ITB2 and other integrins with low 
ligand-binding capacity. The ITB2/ICAM-1 interaction has become an important strategic 
target to approach inflammation, autoimmune diseases and cancer (Yusuf-Makagiansar, 

10 H. et aL, Med Res Rev 22 (2002) 146-167), hence these findings strongly support the 
validity of the identified ITB2 epitope as a candidate marker for RA. 



Phosphatidylinositol-4,5-bisphosphate3-kinase 

Another epitope with a moderate HLA-DRB1*0401 binding motif was identified in 
15 two non-erosive serum samples: the 17-mer PI3K (792-808; SEQ. ID NO: 60) with the 
amino acid sequence NKVFGEDSVGVIFKNGD derived from Phosphatidylinositol 3- 
kinase (PI3K ) (Table 1). In this peptide 794V could serve as a hydrophobic PI anchor, 
797E as a negatively charged P4 anchor, and 799S as a typical DR4-P6 anchor (TEPITOPE 
binding score: 3%). 

20 PI3K is ubiquitously expressed in many cell types and phosphoryiates lipids, 

predominantly phosphatidylinositol-4,5-bisphosphate. The protein has emerged as a key 
signal transducer for survival factor receptors, including growth factors, cytokines, and 
integrins (reviewed by Toker, A. & Cantley, L., Nature 387 (1997) 673-676). In addition, 
PI3K plays an important role in the signaling pathway of Toll-like receptors (TLRs) that 

25 recognize a variety of microbial products, collectively termed pathogen-associated 
molecular patterns (PAMPs) (Fukao, T. 8c Koyasu, S., Trends Immunol 24 (2003) 358- 
363). Stimulation through TLRs by PAMPs, such as lipopolysaccharide (LPS) (endotoxin) 
triggers production of various cytokines, including IL-12, which is a key cytokine in TLR- 
mediated Thl responses. It was shown that PI3K is an endogenous suppressor of TLR- 

30 mediated IL-12 production and limits excessive Thl polarization. Thus PI3K was suggested 
to be a negative regulator of innate immune responses in order to prevent prolonged 
activation of innate immunity harmful to the host. The knowledge that autoimmune 
responses in RA rely on Thl cytokines, ultimately links PI3K with RA. Although no specific 
pathogen has been consistently linked to the development of RA, it is believed that RA 

35 develops in two stages in which an initial response induced by foreign antigens, 
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subsequently develops into a self-sustaining autoimmune response (Klinman, D., Arthritis • 
Rheum 48 (2003) 590-593). 



Urokinase-type plasminogen activator 

5 Another epitope which was only found in non-erosive RA samples was derived from 

a urokinase-type plasminogen activator (uPA): the 16-mer uPA (328-343; SEQ. ID NO: 61) 
with the amino acid sequence YPEQLKMTVVKLISHR (Table 1). With regard to HLA- 
DRB1*0401 binding, the sequence reveals a moderate binding motif: 332L, 335T and 337V 
are putative PI, P4 and P6 anchors, respectively (TEPITOPE binding score: 2%). 

10 Plasminogen activators (PAs) are highly specific serine proteases generating plasmin 

from plasminogen. Two types of PAs, a urokinase-type (uPA) and a tissue-type (tPA) have 
been identified in mammals. The activities of PAs are controlled by natural inhibitors 
(PAIs). Plasmin is involved in inflammatory reactions by inducing cytokines such as TGFfi, 
and in cartilage and fibrin degradation. Evidences for ongoing coagulation, within the 

15 rheumatoid joint together with an enhancement of uPA synthesis and activation in 
arthritic joints have led to the hypothesis that the PA/plasmin system is associated with the 
clinical severity of arthritis (reviewed by Busso, N. 8c Hamilton, J.A., Arthritis Rheum 46 
(2002) 2268-2279). Based on in vitro studies, there are several cell types present in the 
inflamed joints of RA patients, in particular monocytes and synovial fibroblasts which can 

20 express PAs and PAIs and which therefore could contribute to the in vivo levels. The 
activity of PAs (i.e. uPA) is stimulated by cytokines, such as IL-1 and TNFa, which are 
known to be highly upregulated in serum as well as in synovial fluid of RA patients. These 
lines of evidence render uPA (328-343) a putative peptide marker for RA. 



25 Immimglobulin heavy chain V-III region (V H 26) 

One of the identified epitopes that display a very strong binding motif with regard to 
HLA-DRB1*0401 binding, is derived from the Vh26 gene segment of the immunglobulin 
heavy chain: the 16-mer V H 26 (95-110; SEQ. ID NO: 62) with the amino acid sequence 
KNTLYLQMNSLRAEDT (Table 1). The high TEPITOPE binding score (1%) is 
30 substantiated by 99Y functioning as a very potent PI anchor, 102M as a moderate P4 
anchor and 104S as a typical HLA-DR4 P6 anchor. 

Immunglobulins (Ig) are responsible for antigen binding and stimulate further 
immune reactions, e.g. by binding to isotope-specific Fc receptors. Interestingly, the 
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human V H 26 gene segment appears to encode a high-avidity synovial rheumatoid factor 
and thus seems to have an impact on RA progression (Wong, A. et al., Autoimmunity 20 
(1995) 191-199). The fact that the identified epitope V H 26 (95-110) was found in the serum 
of two seropositive RA patients (Table 1) correlates with the above observation. 

5 

DJ-1 protein 

In two non-erosive RA samples our continued studies revealed the presence of an 
epitope which is derived from a protein termed DJ-1: the 16-mer DJ-1 (135-150; SEQ. ID 
NO: 63) with the amino acid sequence NGGHYTYSENRVEKDG (Table l)/According to 
10 TEPITOPE scoring the peptide contains a rather weak HLA-DKB1 *0401 binding motif with 
139Y serving as a PI anchor, 142S as a P4 anchor and 144N as a possible P6 anchor 
(binding score: 8%). 

DJ-1 belongs to the ThiJ/PfpI protein family whose members are evolutionary 
distributed from Archaea to Eukarya. ThiJ/PfpI proteins share a conserved ThiJ domain 
15 that is structurally related to the type I glutamine amidotransferase domain (Lee, SJ. et al., 
J Biol Chem 25 (2003) Epub ahead of print). 

DJ-1, which is preferentially expressed in testis and moderately in other tissues, was 
first identified as a novel candidate of the oncogene product that transformed mouse 
NIH3T3 cells in cooperation with ras. In the mean time additional physiological roles of 
20 DJ-1 were revealed including a strong correlation to Parkinson disease and sperm 
fertilization. DJ-1 seems to have multiple functions - here we provide the first indication 
for an association of DJ-1 with RA. 



Example 2 

25 In this example, the same technology was used that has been described in detail in 

example 1. Serum (6 samples) and synovial fluid (2 samples) of patients with diagnosed 
erosive RA were utilized in this case to identify candidate markers specific for erosive RA. 

The peptide sequences found in the erosive RA samples were compared with the 
sequences identified in unpulsed DCs (control 1) and in DCs pulsed with the serum of 
30 healthy test persons (control 2). Among the RA-specific sequences, only those peptides 
were selected for further evaluation that re-occurred in at least three of six erosive RA 
samples. 
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In this study one epitope was discovered which occurred, apart from one exception, 
only in erosive RA samples. 



Apolipoprotein B-100 

5 The epitope which was mainly found in erosive RA sera (4 out of 8 erosive RA 

samples) is derived from apolipoprotein B-100: the 16-mer ApoB (2877-2892) with the 
amino acid sequence of SEQ ED NO: 4 (Table 2). In addition a length variant of the same 
epitope was identified (Table 2): the 17-mer ApoB (2877-2893; SEQ ID NO: 5). The 
following DRB1*0401 binding motif can be predicted: 2881L as a PI anchor, 2884D as a P4 
10 anchor and 2886N as a P6 anchor (binding score 3%). 

In an earlier study on EBV-B cells, the epitope ApoB (2885-2900), which pardy 
overlaps with the epitope described here, has been found in the context of HLA-DR4 
(Chicz, R.M. et al., J Exp Med 178 (1993) 27-47). 

Apolipoprotein B-100 is a constituent of very low-density lipoproteins (VLDL) and 
15 low-density lipoproteins (LDL) and functions as a recognition signal for the cellular 
binding and internalization of LDL particles by the ApoB/E receptor (Yang, C.Y. et al., 
Nature 323 (1986) 738-742). Interestingly, an increased ratio of LDL cholesterol to HDL 
cholesterol was observed among newly diagnosed RA patients (Park, Y.B. et al., J 
Rheumatol 26 (1999) 1701-1704). The adverse lipid profile in active RA could be improved 
20 by treating RA patients with DMARDs without the use of lipid-lowering agents (Park, Y.B. 
et al., Am J Med 113 (2002) 188-193). Since an increased cardiovascular mortality among 
patients with chronic inflammatory diseases, such as RA, is well documented (Symmons, 
D.P. et al., J Rheumatol 25 (1998) 1072-1077) it was suggested that local inflammation in 
RA leads to altered blood lipid levels, thereby increasing the risk of atherosclerosis. The 
25 question whether components of the lipoprotein metabolism are causal for pathogenesis or 
merely affected by ongoing immune reactions during RA development cannot be answered 
yet However, the observation of adverse lipid profiles in RA patients supports the validity 
of the presented ApoB epitope as a serum-derived RA candidate marker. 

The length variant ApoB (2877-2892), but not ApoB (2877-2893), has been identified 
30 in samples of two healthy controls (Table 2). Since Apolipoprotein B constitutes 1% of all 
plasma proteins, the presence of ApoB epitopes in healthy control samples is not 
surprising. The results suggest that only the length variant ApoB (2877-2893; SEQ ID NO: 
5) is specific for erosive RA. 



26S proteasome non- ATPase regulatory subunit 8 
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An epitope which was quite frequently identified in mostly erosive RA samples both 
from serum and synovia, is derived from the regulatory subunit 8 of the 26S proteasome 
(PSMD8): the 15-mer PSMD8 (218-232; SEQ. ID NO: 64) with the amino acid sequence 
GPNNYYSFASQQQKP (Table 2). Three additional length variants were identified: the 16- 
5 mer PSMD8 (218-233; SEQ. ID NO: 65), the 17-mer PSMD8 (218-234; SEQ. ID NO: 66) 
and the 18-mer PSMD8 (218-234; SEQ. ID NO: 67) (Table 2). The presence of length 
variants supports the validity of the identified epitope as a class II MHC-derived antigenic 
peptide. The peptide displays a moderate binding motif with respect to DRB1 *0401 binding 
(TEPITOPE binding score: 3%). Binding to HLA-DR4 could be confirmed in an in vitro 
10 binding assay using synthetic PSMD8 (218-233) peptide and purified HLA-DR4 molecules 
(Fig. 3): According to its IC 50 value against the reporter peptide HA(308-319), PSMD8 
(218-233; SEQ. ID NO: 65) binds to HLA-DR4 with a moderate affinity, confirming the 
TEPITOPE prediction. The 15-mer PSMD8 (218-232; SEQ. ID NO: 64) was identified once 
in an unpulsed control sample. 

15 The proteasome accouts for about 1% of the cytoplasmic protein pool and is 

involved in ATPrdependent degradation of ubiquitinated proteins. Moreover, the 
proteasome is responsible for the processing of several transcription factors (i.e. nuclear 
factor-KB), for cell cycle control and the generation of MHC class I restricted antigens. The 
regulatory subunits of the proteasome are important for the selectivity of protein 

20 degradation. The non-ATPase regulatory subunit 8, in particular, is known to be necessary 
for the activation of the cell division control protein 28 (Cdc28), an essential cell cycle 
regulator in yeast. 

Importantly, strongly increased levels of circulating proteasomes (cProteasomes) 
were detected in serum samples obtained from patients with different systemic 

25 autoimmune diseases, including RA (Egerer, K. et al., J Reumatol 29 (2002) 2045-2052). 
There appears to be a dose correlation between the levels of cProteasome and disease 
activity and concentrations of C-reactive protein in patients with severe RA cProteasome is 
discussed to trigger subsequent immune responses, which would indicate an antigen- 
driven mechanism. The concentration of released Proteasome antigen seems to reflect the 

30 magnitude of cellular damage in autoimmune diseases. From their findings Egerer and 
coworkers concluded that cProteasomes could represent a novel marker for disease severity 
in autoimmune processes. Since the epitope PSMD8 (218-232; SEQ. ID NO: 64) was 
mainly identified in erosive RA samples our analysis supports this conclusion. 



35 Interleukin-1 receptor 
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Another candidate marker for erosive RA is the Interleukin-1 receptor (IL-1R) which 
was identified in two erosive serum samples .via its peptide IL-1R (79-94; SEQ. ID NO: 68) 
with the amino acid sequence EKLWFVPAKVEDSGHY (Table 2). IL-1R (79-94; SEQ. ID 
NO: 68), with 83F as a PI anchor, 86A as a P4 anchor and 88V as a P6 anchor, contains a 
5 strong binding motif with regard to binding to the RA susceptibility allele DRB1*0401 
(TEPITOPE binding score: 1%). In an in vitro binding assay the synthetic peptide was 
shown to bind to HLA-DR4 molecules with high affinity (Fig. 3), similar to HA(309-319) 
peptide. 

Interleukin-1 (IL-1) is a proinflammatory cytokine and implicated in a variety of 
10 infectious immune responses as well as in RA and other inflammatory diseases (Dinarello, 
C, Blood 87 (1996) 2095-2147). It binds to its respective receptor which functions as a 
signal transducer to trigger cell proliferation or to stimulate protein synthesis. Increased IL- 
1 production has been observed in RA patients and plays a pivotal role in its clinical 
manifestations (Dayer, J.M., Rheumatology 42 (2003) ii3-iil9). IL-1 is viewed as a key 
15 mediator in RA through activation of macrophages and T- and B- lymphocytes. In 
addition, IL-1 contributes to inflammation by inducing the expression of cell-adhesion 
molecules, other cytokines and chemokines. Furthermore IL-1 is also pivotal in the 
destruction of bone and cartilage in RA by stimulating the production of matrix 
metalloproteinases. Thus, the IL-1/IL-1R complex is a first class target for anti- 
20 inflammatory therapeutics. The use of a recombinant human IL-1 receptor antagonist (IL- 
1RA) has been approved for the treatment of RA patients. 



Fibromodulin 

Three erosive RA samples gave rise to an epitope derived from the secreted matrix 
25 protein fibromodulin (FM): the 13-mer FM (178-190; SEQ. ID NO: 70) with the amino 
acid sequence LRELHLDHNQISR (Table 2). Furthermore, the 14-mer FM (177-190; SEQ. 
ID NO: 69) was identified, which was also found once in an impulsed control experiment. 
The epitope depicts a strong DRB 1*0401 binding motif with 181L serving as a PI anchor, 
184D as a P4 anchor and 186N as a P6 anchor (TEPITOPE binding score: 1%). 

30 Fibromodulin belongs to a family of small leucine-rich proteoglycans (SLRPs) that 

bind to TGF0S and collagens and other extracellular matrix molecules. In vitro, SLRPs were 
shown to regulate collagen fibrillogenesis, a process essential in development, tissue repair 
and metastasis. To better understand the function of SLRPs in vivo y SLRP-deficient mice 
were generated and shown to develop a wide array of diseases (e.g. osteoporosis and 

35 osteoarthritis), most of them resulting primarily from an abnormal collagen fibrillogenesis 



WO 2005/014622 



49 



PCT/EP2004/008609 



(Ameye, L. & Young, M.F., Glycobiology 12 (2002) 107R-116R). Since collagen formation 
and degradation is highly enhanced in the inflamed joints of RA patients, an increased level 
of fibromodulin can be envisaged. In accord with this consideration is the fact that the 
identified FM epitope was primarily found in RA samples of synovial origin (3 out of 4 
5 synovia samples). 



GM-CSF/IL-3/IL-5 receptor 

In 5 out of 8 samples from erosive RA patients an epitope from the P-chain of the 
multiple cytokine receptor CYRB could be identified: the 15-mer CYRB (359-373; SEQ. ID 

10 NO: 71) with the amino acid sequence ETMKMRYEHIDHTFE and its length variant, the 
17-mer CYRB (359-375; SEQ. ID NO: 72) (Table 2). In an in vitro binding assay the 
synthetic CYRB (359-375) peptide was shown to bind to HLA-DR4 molecules with 
moderate affinity (Fig. 3). This is in agreement with TEPITOPE which predicted a 
moderate peptide binding motif with respect to DRB1*0401 binding (TEPITOPE binding 

15 score: 3%). The epitope was clearly overrepresented in the RA samples since it was 
identified in the serum of only one healthy test person. 

The GM-CSF/IL-3/IL-5 receptor is a type I membrane protein and differentially 
expressed throughout the hematopoietic system (reviewed by Geijsen, N. et aL, Cytokine 
Growth Factor Rev 12 (2001) 19-25). Its ligands, IL-3 and GM-CSF, are secreted by CD4+ 

20 T cells and important stimuli for the formation of dendritic cells originating from 
progenitor cells in the bone marrow. The critical role of dendritic cells in the initiation and 
development of an immune response suggests that they play a key role in the development 
of autoimmune inflammatory disorders, such as RA, by transporting autoantigen to the 
draining lymph node where DCs encounter and prime naive T cells. The activity of GM- 

25 CSF has been linked to proinflammatory effects in RA supporting the identified receptor 
epitope CYRB (359-373; SEQ. ID NO: 71) as a putative candidate peptide marker for the 
diagnosis of RA. 



Sorting nexin 3 

30 Another epitope which appears to be indicative for erosive RA is derived from a 

protein termed sorting nexin 3 (SNX3): the 16^mer SNX3 (142-157; SEQ. ID NO: 73) 
having the amino acid sequence HMFLQDEIIDKSYTPS (Table 2). In this epitope 144F, 
147D and 1491 could serve as PI, P4 and P6 anchors, respectively, in the peptide-binding 
groove of the RA susceptibility allele DRB1 *0401 (TEPITOPE binding score: 2%). 
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Sorting nexins are a diverse group of cellular trafficking proteins that share a 
common phospholipid-binding motif (reviewed by Worby, C.A. & Dixon, J.E., Nat Rev 
Mol Cell Biol 3 (2002) 919-31). The ability of these proteins to bind specific phospholipids, 
as well as their propensity to form protein-protein complexes, points to the involvement of 

5 these proteins in membrane trafficking and protein sorting. Sorting nexin 3 in particular is 
present in the cytosol and in endosomes and appears to be involved in membrane 
trafficking from early to recycling endosomes. Whether sorting nexin 3 plays a role in RA, 
e.g. by influencing antigen presentation routes, remains unknown at the moment. The 
epitope SNX3 (142-157; SEQ. ID NO: 73) identified in this study suggests a link between 

10 sorting nexins and autoimmunity. 



Example 3 

All peptide sequences identified in examples 1 and 2 from non- erosive and erosive 
RA samples were used in this example to search for common markers relevant for both RA 
15 types. The RA-specific sequences were again compared with peptide sequences of the 
control samples (impulsed DCs and DCs pulsed with the serum of two healthy test 
persons) and only those peptides were selected for further evaluation that re-occurred in at 
least three of altogether fifteen RA samples (erosive and non-erosive RA). 



20 Inter-alpha-trypsin inhibitor 

Ten out of eleven serum samples (erosive & non-erosive RA) gave rise to an epitope 
derived from the heavy chain H4 of the inter-alpha-trypsin inhibitor: ITIH4 (271-287) with 
the amino acid sequence of SEQ ID NO: 8 (Table 3). Apart from this major length variant 
of the ITIH4 epitope, six length variants of the same ITIH4 epitope could be identified 
25 (Table 3): the 19-mer ITTH4 (271-289; SEQ ID NO: 6), the 18-mer ITIH4 (271-288; SEQ 
ID NO: 7), the 16-mer ITIH4 (274-289; SEQ ID NO: 12), the 15-mer ITIH4 (273-287; SEQ 
ID NO: 10), the 15-mer (274-288; SEQ ID NO: 11) and the 14-mer ITIH4 (274-287; SEQ 
ID NO: 9). 

As judged from the shortest length variant, ITIH4 (274-287), the epitope contains a 
30 very strong binding motif, with regard to binding to the RA susceptibility allele 
DRB1*0401: 277V serves as a PI anchor, 280D as a P4 anchor and 282S as a P6 anchor 
(binding score: 1%). 
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ITIH4 belongs to the Inter-alpha-inhibitor (Iod) family which is a group of serum 
protease inhib itors that bind to hyaluronic acid (HA) and appear to be involved in acute- 
phase reactions (Salier, J.P. et al., Biochemical Journal 315 (1996) 1-9). 

HA is a polysaccharide found in all tissues of the body, in particular, in loose 
5 connective tissue, e.g. joint fluid (Evered, D. & Whelan, J. eds., The Biology of Hyaluronan, 
John Wiley & Sons (1989)). HA has an important structural function in cartilage and other 
tissues where it stabilizes the extracellular matrix by forming aggregates with proteoglycans. 
It has also been assigned important biological functions by regulating cellular activities via 
binding to cell surface proteins, such as CD44 and ICAM-1 (Knudson, C.B. & Knudson, 
10 W., FASEB J 7 (1993) 1233-1241; Hall, C.L. et al., J Cell Biol 126 (1994) 575-588). RA is 
accompanied by a large increase in total HA in the joint fluid as well as in the serum, 
suggesting that circulating HA originates from rheumatoid joints (Engstrom-Laurent, A. et 
al., Scand J Clin Lab Invest 45 (1985) 497-504). 

Complexes of HA and some loci family members were observed in large amounts in 
15 the synovial fluid of RA patients (Jessen, T.E. et al., Biological Chemistry Hoppe-Seyler 375 
(1994) 521-526). The role of the Iotf-HA complex in inflammatory reactions might be to 
modify the CD44-HA interaction that mediates leukocyte activation and invasion (Isacke, 
CM. 8c Yarwood, H., Int J Biochem Cell Biol 34 (2002) 718-721). Additionally, synovial 
fluid of RA patients contains elevated levels of TSG-6, an anti-inflammatory glycoprotein 
20 and a member of the hyaladherin family of HA-binding proteins (Wisniewski, H.G. et al., J 
Immunol 151 (1993) 6593-6601). It has been shown that a complex of TSG-6 with led 
family members inhibits the activity of plasmin, a central molecule in the activation of 
inflammation-associated enzymes (Wisniewski, H.G. et al., J Immunol 156 (1996) 1609- 
1615). A regulation of plasmin activity by several acute-phase plasma proteins, namely 
25 TSG-6 and Iod family members, may prove to be important in RA, given the high contents 
of HA, TSG-6 and led family members in synovial fluid of inflamed joints. 

This evidence, together with the identification of multiple length variants of the same 
epitope and a strong HLA-DR4 binding motif, convincingly support the validity of the 
presented ITIH4 epitope as a serum-derived RA candidate marker. 

30 

Complement C4 

In eight out of eleven RA sera (erosive and non-erosive) tested, another dominant 
epitope was identified which is derived from complement C4: the 15-mer C4 (1697-1711) 
with the amino acid sequence of SEQ ID NO: 13 (Table 3). Five additional length variants 
35 of the same epitope could be found (Table 3): the 12-mer C4 (1697-1708; SEQ ID NO: 18), 
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the 13-mer C4 (1698-1710; SEQ ID NO: 17), the 14-mer C4 (1697-1710; SEQ ID NO: 15), 
the 16-mer C4 (1697-1712; SEQ ID NO: 14) and the 18-mer C4 (1697-1714; SEQ ID NO: 
16). Moreover, the presented epitope displays a very strong DRB1*0401 binding motif: 
1700Y as PI anchor, 1704D as P2 anchor and 1706N as P6 anchor (binding score: 1%). 

5 C4 which constitutes approximately 0.5% of plasma protein mass plays a critical role 

in the triggering of the central pathway of the complement system. The protein is 
synthesized as a single-chain precursor and, prior to secretion, is enzymatically cleaved to 
form a trimer of non-identical cc-, P-, and y-chains. The identified epitope C4 (1697-1711) 
is located at the very C-terminus of the C4 y-chain. The C4 cc-chain is further 
10 proteolytically degraded by activated CI to form the C4a anaphylatoxin, which is a 
mediator of local inflammatory processes (Moon, K.E. et al., J Biol Chem 256 (1981) 8685- 
8692). 

In general, the complement cascade is involved in the induction and progression of 
inflammatory reactions and is a major defense system against various pathogenic agents, 
15 including bacteria, viruses and other antigens (Morgan, B.P., Methods Mol Biol 150 (2000) 
1-13). Inappropriate activation, however, can lead to tissue damage and manifestation of 
disease (Speth, Q et al., Wien Klin Wochenschr 111 (1999) 378-391). 

Activation of the complement system has been repeatedly implicated in the 
pathogenesis of RA, based on studies showing increased levels of complement metabolites, 

20 including C4 and C4a, in plasma, synovial fluid and synovial tissue of RA patients 
(Neumann, E. et al., Arthritis Rheum 46 (2002) 934-945). In addition collagen-induced 
arthritis (CIA) in mice is characterized by the presence of complement activation products 
(Linton, S.M. & Morgan, B.P., Mol Immunol 36 (1999) 905-914). CIA is prevented after 
treatment with anti-C5 monoclonal antibodies (Wang, Y. et al., PNAS 92 (1995) 8955- 

25 8959) or with soluble CR1, an inhibitor of the complement system, delivered by gene 
therapy (Dreja, H. et al., Arthritis Rheum 43 (2000) 1698-1709). Activation of complement 
factors in joints is possibly induced by the presence of various immune complexes and it 
was hypothesized that stimulation of the innate immune system by infectious agents and 
cytokines may contribute to the initiation of RA (Friese, M.A. et al., Clin Exp Immunol 121 

30 (2000) 406-414). 

Two of the six presented C4 epitopes, the 15- and the 18-mer, were also identified in 
healthy control samples (Table 3) indicating that only some length variants of this C4 
epitope are RA-specific, namely the antigenic peptides of SEQ ID NOs: 14, 15, 17, and 18. 



35 Complement C3 
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Another epitope that was found in erosive and non-erosive RA samples is derived 
from complement C3 (alpha-chain): the 14-mer C3 (1431-1444) with the amino acid 
sequence of SEQ ID NO: 21 (Table 3). Six additional length variants of the same epitope 
were identified in serum (Table 3): the 13-mer C3 (1431-1443; SEQ ID NO: 23), the 14- 
5 mer C3 (1429-1442; SEQ ID NO: 74), the 15-mer C3 (1431-1445; SEQ ID NO: 22), the 15- 
mer C3 (1429-1443; SEQ ID NO: 20), the 17-mer C3 (1427-1443; SEQ ID NO:75) and the 
19-mer C3 (1426-1444; SEQ ID NO: 19). As judged from the shortest length variant, C3 
(1431-1443), a DRB1*0401 binding motif can be postulated: 1434Y serves as a PI anchor, 
1437D as a P4 anchor and 1439A as a P6 anchor. 

10 In erovise and non-erosive RA samples a further epitope was found, which is derived 

from complement C3 (beta-chain): the 19 mer C3 (157-175) with the amino acid sequence 
of SEQ. ID NO: 76 (Table 3). One additional length variant of the same epitope was 
identified in serum (Table 3): 20-mer C3 (157-176; SEQ. ID NO: 77) 

Complement C3 which constitutes about 1-2% of plasma protein mass plays a central 
15 role in the activation of the complement system and belongs to the family of the acute- 
phase proteins. Its processing by C3 convertase to C3a anaphylatoxin and C3b is the central 
step in both the classical and alternative complement pathways (Barrington, R. et aL, 
Immunol Rev 180 (2001) 5-15). After activation, C3b can bind covalently, via a reactive 
thiolester, to cell surface carbohydrates or immune aggregates (Isaac, L. & Isenman, D.E., J 
20 Biol Chem 267 (1992) 10062-10069). The identified epitope C3 (1431-1444) is located at 
the C-terminus of C3b. 

As already discussed in the context of complement epitope C4 (1697-1711), there is 
increasing evidence for an important role of components of the complement cascade in the 
pathophysiology of RA. The result of this study, in which two major epitopes derived from 
25 complement C3 and C4 were identified in serum of RA patients, underlines the close link 
between the activated complement system and pathogenesis of RA. This coincidence makes 
a strong argument for the validity of the presented C3 / C4 epitopes as serum-derived 
candidate RA markers. 



30 SH3 domain-binding glutamic acid-rich-like protein 3 

Another epitope which was elucidated quite frequently in serum of RA patients (six 
out of eleven erosive and non-erosive RA samples), is derived from the SH3 domain- 
binding glutamic acid-rich-like protein 3 (SH3BGRL3): SH3BGRL3 (15-26) with the 
amino acid sequence of SEQ ID NO: 25 (Table 3). Three length variants of the same 
35 epitope were identified (Table 3): the 14-mer SH3BGRL3 (13-26; SEQ ID NO: 26), the 14- 
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mer SH3BGRL3 (15-28; SEQ ID NO: 27) and the 16-mer SH3BGRL3 (13-28; SEQ ID NO: 
24). The DRB1*0401 binding motif is: 171 as PI anchor, 20Q as P4 anchor and 22S as P6 
anchor (binding score 4%). 

SH3BGRL3 is a small 10 kD protein that belongs to the SH3BGR family. The precise 
5 function of the protein is unknown but a role as a modulator of glutaredoxin biological 
activity is postulated (Mazzocco, M. et aL, Biochem Biophys Res Commun 285 (2001) 540- 
545). So far, SH3BGRL3 has not been described in the context of RA. 

Interestingly, the analysis elucidated a second epitope of the same protein, which was 
highly abundant in all RA and control samples: the 16-mer SH3BGRL3 (29-44) with the 

10 amino acid sequence DGKRIQYQLVDISQDN. In addition multiple length variants of the 
same epitope were found in most samples as well. As judged from the shortest length 
variant, SH3BGRL3 (31-42), the epitope contains almost similar DRB1*0401 anchor 
residues compared with SH3BGRL3 (15-26): 331 serves as a PI anchor, 36Q as a P4 anchor 
and 38V as a P6 anchor (binding score -2). This similarity is reflected by comparable 

15 binding scores. 

The presence of this second SH3BGRL3 epitope supports the validity of the 
SH3BGRL3 (15-26) epitope because both peptides are derived from the same protein, 
however, only one of them, epitope SH3BGRL3 (15-26), appears to be generated in a RA- 
specific maimer. A similar observation has been described already for GILT in example 1. 

20 Among the four SH3BGRL3 length variants the longest variant, SH3BGRL3 (13-28), 

was also identified in a healthy control sample (Table 3). However, this particular length 
variant was found only one time, which indicates a significant enrichment of the 
SH3BGRL3 epitope in the context of RA. 



25 Interleukin-4 (B>4) induced protein 1 

In all the investigated synovial fluids (erosive & non-erosive RA) and in eight out of 
eleven sera (erosive 8c non-erosive RA), one highly dominant epitope was identified which 
is derived from the human homolog of the IL-4 induced protein 1 (Figl): Figl (293-309) 
with the amino acid sequence of SEQ ID NO: 28 (Table 3). The validity of the epitope was 
30 further supported by the presence of additional length variants in several samples (Table 3): 
the 16-mer Figl (293-308; SEQ ID NO: 30) and the 19-mer Figl (293-311; SEQ ID NO: 
29). Moreover, the amino acid sequence displays a typical DRB1*0401 binding motif: 299V 
serves as PI anchor, 302E as P4 anchor and 304S as P6 anchor (binding score 1%). 
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Two length variants of the same epitope, Figl (293-308) and Figl (293-309), were 
identified in one unpulsed sample and in one healthy control sample as well (Table 3). 
However, the presence of the Figl epitope in almost all RA samples but not in all of the 
control samples tested strongly indicates an enrichment in the context of RA. 

5 The human figl gene was first identified in IL-4-stimulated B cell cultures (Chu, C.C. 

& Paul, W.E., PNAS 94 (1997) 2507-2512). The human figl resides on chromosome 
19ql3.3-19ql3.4, a region previously identified to be involved in susceptibility to 
autoimmune diseases, including SLE, arthritis, multiple sclerosis, and insulin-dependent 
diabetes mellitus (Becker K.G. et aL, PNAS 95 (1998) 9979-9984). Since its expression is 

10 largely limited to immune tissues and its regulation is dependent on IL-4, a key modulator 
of the immune response, figl is thus an attractive candidate gene for autoimmune disease 
susceptibility (Chavan, S.S. et aL, Biochim Biophys Acta 1576 (2002) 70-80). The HLA- 
DR4-restricted presentation of a Figl epitope provides the first indication that Figl protein 
is produced and possibly involved in the disease development of RA. The Figl polypeptide 

15 has not been known as a marker for RA until now, and is considered as an important 
candidate marker for RA. 



Hemopexin 

Another RA candidate marker which was frequently identified in serum samples (ten 
20 out of eleven samples) and in synovia samples (two out of four samples) (erosive & non- 
erosive RA) is derived from hemopexin (HPX): HPX (351-367) with the amino acid 
sequence of SEQ ID NO: 32 (Table 3). Several length variants were found which support 
the validity of this epitope (Table 3): the 13-mer HPX (351-363; SEQ ID NO: 33), the 14- 
mer HPX (350-363; SEQ ID NO: 34), the 15-mer HPX (351-365; SEQ ID NO: 35), the 18- 
25 mer HPX (351-368; SEQ ID NO: 31) and the 18-mer HPX (350-367; SEQ ID NO: 78). 
Furthermore, the epitope contains a very strong DRB1*0401 binding motif: 3551 serves as a 
PI anchor, 358D as a P4 anchor and 360V as a P6 anchor (binding score: 1%). 

Two length variants of the same epitope, HPX (351-367; SEQ ID NO: 32) and HPX 
(351-365; SEQ ID NO: 35), could also be identified in healthy control samples (Table 3) 
30 indicating that only some length variants are specific for RA, namely the antigenic peptides 
of SEQ ID NOs. 31, 33, 34 and 78. 

HPX is a 60 kD plasma glycoprotein with a high binding affinity to heme (Miiller- 
Eberhard, U., Methods Enzymol 163 (1988) 536-565). It is mainly expressed in the liver, 
and belongs to the acute-phase proteins the synthesis of which is induced in an 
35 inflammatory situation. RA is a chronic inflammatory autoimmune disease and elevated 
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levels of several acute-phase proteins, including C-reactive protein and serum amyloid A, 
have been reported (Nakamura, R., J Clin Lab Anal 14 (2000) 305-313). HPX is responsive 
to the cytokines IL-1 and IL-6, which are upregulated in patients suffering from RA 
(Feldmann, M. & Maini, R.N., Rheumatology 38, Suppl 2 (1999) 3-7). 

5 HPX is the major vehicle for the transportation of heme in the plasma and its 

principal role is to prevent heme-mediated oxidative stress and loss of heme-bound iron 
(Tolosano, E. & Altruda, R, DNA Cell Biol 21 (2002) 297-306). It can protect cells against 
oxidative stress by inducing the expression of intracellular antioxidants such as heme 
oxygenase, metallothioneins and ferritin. Metallothioneins are cytosolic proteins that are 

10 expressed particularly in synovial fibroblasts (Backman, J.T. et al., Virchows Arch 433 
(1998) 153-160). There is significant experimental evidence for the presence of oxidative 
stress in the synovial tissue of RA patients (reviewed in: Schett, G. et al., Arthritis Res 3 
(2000) 80-86). Furthermore HPX was reported to promote proliferation of human T 
lymphocytes (Smith, A. et al., Exp Cell Res 232 (1997) 246-254). These studies render it 

15 likely that HPX belongs to the up-regulated proteins in serum and synovia of RA patients, 
thereby providing a rationale for the relevance of HPX (351-367) as a RA-specific candidate 
marker. 



Hsc70-interacting protein 

20 An epitope which was mosdy identified in serum samples (4 out of eleven erosive and 

non-erosive RA samples) and which is also related to stress responses is derived from the 
Hsc70-interacting protein Hip: Hip (83-98) with the amino acid sequence of SEQ ID NO: 
38 (Table 3). Two length variants of this epitope were identified (Table 3): the 18-mer Hip 
(83-100; SEQ ID NO: 36) and the 15-mer Hip (84-98; SEQ ID NO: 39). An additional 

25 length variant was discovered in one erosive synovia sample (Table 3): the 15-mer Hip (85- 
99; SEQ ID NO: 37). As judged from the shortest length variant Hip (84-98) a DRB1*0401 
binding motif attaining a moderate score of 8%, can be postulated: 891 as PI anchor, 92D 
as P4 anchor, 94D as P6 anchor. 

In the cytosol of eukaryotic cells, Hip and Hop proteins associate with Hsc70 in order 
30 to participate in the regulation of Hsc70 chaperone activity (Frydman, J. 8c Hohfeld, J., 

Trends Biochem Sci 22 (1997) 87-92). The 42 kD Hip protein binds to the ATPase domain 
of Hsc70. It was postulated that Hip might increase the half-life of the chaperone-substrate 
complex providing the molecular basis for an efficient cooperation of Hsc70 with 
downstream chaperone systems. Hsc70 and Hsp90 have been shown to cooperate during 
35 protein folding in vitro (Jakob, U. 8c Buchner, J., Trends Biochem Sci 19 (1994) 205-211; 
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Freeman, B.C. & Morimoto, R.I., EMBO J 15 (1996) 2969-2979) and to play a role in 
thermal denaturation (Schneider, C. et al., PNAS 93 (1996) 14536-14541). The Hsc70 and 
Hsp90 association with stress-adaptation ultimately links Hip. to stress responses, including 
the induction of heat shock proteins, in the synovial tissue of RA patients (reviewed in: 
5 Schett, G. et al., Arthritis Res 3 (2001) 80-86). 



Binding properties of ITIH4, C4, C3, SH3BGRL3, Figl, HPX and Hip 

In order to further investigate the binding properties of the above described antigenic 
peptides of ITIH4, C4, C3, SH3BGRL3, Figl, HPX and Hip to HLA-DR4 molecules, in 

10 vitro binding assays were performed (Fig. 3): According to their IC50 values against the 
reporter peptide HA (309-317), the synthetic peptides ITTH4 (274-287), SH3BGRL3 (13- 
26), and Figl (293-309) bind to HLA-DR4 with high affinity (Fig. 3). Moderate binding to 
HLA-DR4 was measured for the synthetic peptides C4 (1696-1709) and HPX (351-365). 
The synthetic peptide C3 (1431-1444) bound only weakly to HLA-DR4 which is in 

15 agreement with TEPITOPE scoring (9%). No binding was measured for the Hip (84-98) 
peptide (data not shown). 



Invariant chain (Ii) 

Among the candidate marker peptides that appear to be indicative for erosive and 
20 non-erosive RA is an epitope which is derived from the HLA-DR-associated invariant chain 
(Ii): the 15-mer Ii (110-124; SEQ. ID NO: 83) with the amino acid sequence 
ATPLLMQALPMGA1P (Table 3). Moreover, four length variants were identified: the 16- 
mer Ii (109-124; SEQ. ID NO: 81), the 17-mer Ii (109-125; SEQ. ID NO: 80), the 18-mer Ii 
(109-126; SEQ. ID NO: 79) and the 21-mer Ii (109-129; SEQ. ID NO: 82) (Table 3). The 
25 presence of length variants gives confidence in the identified peptide sequence which 
displays a moderate DRB1*0401 binding motif (TEPITOPE binding score: 5%) with 114L 
serving as a PI anchor, 117A as a P4 anchor and 119P as a P6 anchor. The epitope was 
identified in two erosive and three non-erosive RA samples. Two Ii length variants were 
found in one unpulsed sample and two samples of healthy test persons. 

30 Peptide loading of MHC class II molecules is regulated by two accessory molecules in 

the class II pathway, invariant chain Ii and HLA-DM (reviewed by Bakke, O. 8c Nordeng 
T.W., Immunol Rev 172 (1999) 171-187 and Kropshofer, H. et al., Immunol Today 18 
(1997) 77-82). Ii trimers bind to nascent MHC class II molecules in the rough endoplasmic 
reticulum (ER) and block the peptide-binding groove, stabilizing the class II molecule and 
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preventing the binding of ligands available in the ER. Class H/Ii complexes are transported 
via the Golgi apparatus to endosomes, where Ii is degraded to a nested set of class II- 
associated Ii peptides (CLIP). The release of CLIP enables endosomal peptides to bind to 
the MHC class II molecules which are presented to T cell receptors on the cell surface. The 
5 identified epitope Ii (110-124) overlaps with the C-terminal part of CLIP. Interestingly, a 
reduced interaction of CLIP with RA-associated HLA-DR alleles was described (Patil, N.S. 
et aL, J Immunol 167 (2001) 7157-7168), indicating that a reduced interaction may 
contribute to the pathophysiology of autoimmunity in RA. 



10 Retinoic acid receptor responder protein 2 

In 4 out of 11 serum samples (erosive & non-erosive RA) an epitope was identified 
which is derived from retinoic acid receptor responder protein 2 (RARRES2): the 22-mer 
RARRES2 (40-61; SEQ. ID NO: 86) with the amino acid sequence 
HPPVQWAFQETSVESAVDTPFP (Table 3). In the C-terminal part of the epitope two 
15 length variants could have been identified: the 23-mer RARRES2 (40-62; SEQ. ID NO: 84) 
and the 24-mer RARRES2 (40-63; SEQ. ID NO: 85) (Table 3). With 45W as PI anchor, 
48Q as P4 anchor and SOT as P6 anchor, the epitope displays a moderate binding motif in 
the context of DRB1*0401 (TEPITOPE binding score: 3%). 

RARRES2 is a small 18.6 kD protein and mostly expressed in the endothelium and 
20 epidermis. The expression appears to be hormone dependent and a response to retinoic 
acid in skin and some osteotrophic hormones in marrow-derived stromal cells was 
identified (Nagpal, S. et aL, J Invest Dermatol 109 (1997) 91-95 and Adams, A.E. et aL, J 
Cell Biochem 74 (1999) 587-595), The function of RARRES2 is largely unknown and an 
association with RA, for instance through impaired osteoclastogenesis, is imaginable. 

25 

Fibronectin 

Another HLA-DR4 associated peptide which was exclusively found in 3 out of 4 
synovia samples (erosive & non-erosive RA) is derived from fibronectin (Fn), a major 
glycoprotein in blood plasma and in the extracellular matrix: the 15-mer Fn (1881-1895; 
30 SEQ. ID NO: 90) with the amino acid sequence IYLYTLNDNARSSPV (Table 3). The 
epitope appears to be a very good DRB1*0401 binder with 1885Y serving as a hydrophobic 
PI anchor, 1888N as a P4 anchor and 1890N as a P6 anchor (TEPITOPE binding score: 
1%). Four length variants were additionally identified: the 16-mer Fn (1881-1896; SEQ. ID 
NO: 91), the 16-mer Fn (1880-1895; SEQ. ID NO: 88), the 17-mer Fn (1881-1897; SEQ. ID 
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NO: 89) and the 17-mer Fn (1880-1896; SEQ. ID NO: 87), strongly supporting the validity 
of the identified Fn epitope (Table 3). 

Fibronectin plays an important role in cell adhesion, cell motility and in 
opsonization. It binds collagen and fibrin and mediates adhesion of fibroblasts to collagen 
5 fibrils. Fibronectin is strongly expressed in the synovial lining layer of both RA and 
osteoarthritis patients and correlates with hyperplasia of diseased joints. The identification 
of epitope Fn (1881-1895) in only samples of synovial origin is in line with these findings 
and points to a putative role of highly abundant fibronectin in autoimmunity. 



10 Cathepsin B 

Three out of fifteen RA samples (erosive & non-erosive RA) gave rise to an epitope 
which originates from Cathepsin B (CatB): the 15-mer CatB (227-241; SEQ. ID NO: 92) 
with the amino acid sequence YNSYSVSNSEKDIMA (Table 3). The peptide displays a 
moderate DRB1*0401 binding motif with 230Y as a hydrophobic PI anchor and the serine 
15 residues at positions 233 and 235 as putative P4 and P6 anchors (TEPITOPE binding score: 
6%). 

Cathepsin B is a thiol peptidase and believed to participate in intracellular 
degradation and turnover of proteins, e.g. collagen. It is located in lysosomes of different 
cell types including leukocytes. In the context of inflammation and disease it was shown 
20 that Cathepsin B contributes to cartilage destruction in osteoarthritis and pathological 
proteolysis in RA and cancer (Cuimane, G. et al., Arthritis Rheum 44 (2001) 1744-1753). 
Enzyme activities of Cathepsin B and other lysosomal peptidases correlate with RA 
progression which supports the validity of epitope CatB (227-241) as a putative RA marker 
peptide (Sohar, N. et al., Biol Chem 383 (2002) 865-869). 

25 

Tripeptidyl-peptidase II 

Another peptidase-derived epitope which was found in three out of fifteen RA 
samples (erosive & non-erosive RA) is derived from tripeptidyl-peptidase II (TPP2): the 
15-mer TPP2 (970-984; SEQ. ID NO: 93) having the amino acid sequence 
30 AGSLTLSKTELGKKA (Table 3). Additionally, the length variant TPP2 (970-985; SEQ. ID 
NO: 94) was identified. The peptidase epitope contains a typical DRB1*0401 binding motif 
with 973L, 976S and 978T as PI, P4 and P6 anchors respectively (TEPITOPE binding score: 
3%). 
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Similar to Cathepsin B, tripeptidyl-peptidase II is involved in lysosomal protein 
degradation. The identification of epitope TPP2 (970-984) in the context of RA provides 
the first indication that TPP2 might be implicated in impaired protein degradation in the 
context of inflammation and RA. 

5 

Legumain 

Legumain (LGMN) completes the set of three peptidases which were found in this 
investigation. The identified epitope LGMN (99-112; SEQ. ID NO: 95) with the amino acid 
sequence VPKDYTGEDVTPQN (Table 3) displays a typical binding motif with respect to 
10 the HLA allele DRB1*0401 in which 103Y could serve as a PI anchor, 106E as a P4 anchor 
and 108V as a P6 anchor (TEPITOPE binding score: 1%). 

Legumain occurs in endosomal and lysosomal fractions of antigen-presenting cells 
such as dendritic cells (Schwarz, G. et al, Biol Chem 383 (2002) 1813-1816). It has a strict 
specificity for the hydrolysis of asparaginyl bonds and was shown to play an important role 

15 in the processing of bacterial antigens for MHC class II presentation (Manoury, B. et al., 
Nature 396 (1998) 695-699). Whether Legumain is involved in autoimmune diseases, such 
as RA, remains unknown but the fact that three lysosomal peptidases together with GILT 
and the 26S proteasome were identified in this investigation suggests that the protein 
degradation machinery facilitating antigen processing might be significantly altered in the 

20 context of RA. 



Platelet activating factor receptor 

An epitope which was identified in three out of eleven serum RA samples (erosive 8c 
non-erosive RA) is derived from a receptor for platelet activating factor (PAFR): the 13- 
25 mer PAFR (264-276; SEQ. ID NO: 96) with the amino acid sequence DSKFHQAINDAHQ 
(Table 3). According to TEPITOPE scoring (3%) the epitope contains a moderated 
DRB1*0401 binding motif with 267F as PI anchor, 270A as P4 anchor and 272N as P6 
anchor (Table 3). 

Platelet activating factor (PAF) is a pro-inflammatory lipid mediator which binds to a 
30 G-protein-coupled seven transmembrane receptor on the surface of a broad range of cell 
types. By receptor binding, PAF transduces pleiotrophic functions which include cell 
motility, smooth muscle contraction, and synthesis and release of cytokines (reviewed by 
Honda, Z. et al., J Biochem 131 (2002) 773-779). Pharmacological studies and the 
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establishment of PAFR (-/-) mice have suggested that PAP functions in a variety of settings 
including allergy, inflammation, neural functions, reproduction, and atherosclerosis. 
Interestingly, PAF was found in elevated levels in the synovial fluid of RA patients and 
shown to induce neoangiogenesis, which is frequently observed in rheumatoid synovitis 
5 (Lupia, E. et al., Eur J Immunol 26 (1996) 1690-1694). This type of evidence gives further 
confidence in epitope PAFR (264-276) being a RA-associated marker peptide. 



Poly-alpha-2,8-sialylfransferase 

Another epitope which was found in three out of eleven serum samples (erosive & 
10 non-erosive RA) originates from Polysialyltransferase (PST): the 15-mer PST (333-347; 
SEQ. ID NO: 97), MPLEFKTLNVLHNRG (Table 3), displays a moderate binding motif 
with respect to DRB1*0401 binding (TEPITOPE scoring: 2%). 337F could serve as a PI 
anchor, 340L as a P4 anchor and 342V as a P6 anchor. 

Polysialic acid is a carbohydrate composed of a linear homopolymer of a-2,8-linked 
15 sialic acid residues. The glycan is mainly attached to the neural cell adhesion molecule (N- 
CAM) and implicated in many morphogenic processes of the neural cells by modulating 
the adhesive property of N-CAM. The membrane protein polysialyltransferase catalyzes the 
polycondensation of sialic acid residues and is highly expressed in fetal brain, lung and 
kidney, in adult heart, spleen and thymus, and to a lesser extent in peripheral blood 
20 leukocytes (Nakayama, J. et al., Proc Nad Acad Sci 92 (1995) 7031-7035). Elevated PST 
levels were observed in serum of patients with metastatic tumors and an increased enzyme 
activity was also associated with rheumatoid arthritis (Berge, P.G. et al., Klin Wochenschr 
60 (1982) 445-449). The identification of epitope PST (333-347) in several RA samples is in 
line with these observations and supports the putative role of PST in RA. 

25 

Ras-related protein Rab-1 IB 

The last epitope presented in this analysis of RA samples is derived from the Ras- 
related protein Rab-llB: the 13-mer Rab-llB (51-63; SEQ. ID NO: 102) with the amino 
acid sequence RSIQVDGKTTKAQ (Table 3, the peptide sequence were validated with 
30 regard to binding to the RA susceptibility allele D£Bi*0301 by using the TEPITOPE 
software (Hammer, J. et al., Adv Immunol 66 (1997) 67-100)). This epitope and four 
additional lenght variants, the 14-mer Rab-1 IB (50-63; SEQ. ID NO: 100), the 15-mer Rab- 
llB (49-63; SEQ. ID NO: 98), the 17-mer Rab-llB (49-65; SEQ. ID NO: 101) and the 18- 
mer Rab-llB (49-66; SEQ. ID NO: 99) were found in three erosive and two non-erosive 
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RA. samples (Table 3). The 15-mer Rab-llB (49-63) was identified in 1 out of 12 control 
samples? " 

.Rab proteins are small GTPases that play an important role in membrane trafficking 
along the endo- and exocytic pathway. Rab-llB is assumed to be essential for the transport 
5 of internalized transferrin from the recycling compartment to the plasma membrane. 
Furthermore Rab-llB was identified in rat osteoclasts and might play an additional role in 
bone resorption. The epitope Rab-llB (51-63) provides the first indication of a role of 
Rab-llB in RA development. 
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